


Institutional Archive of the Naval Postgraduate School 





Calhoun: The NPS Institutional Archive 
DSpace Repository 


Theses and Dissertations l. Thesis and Dissertation Collection, all items 


1988-06 


Design, implementation, building and 
evaluation of a Torus Double Transitive 
Closure Network of Transputers 


Frazao Sosa, Jose Ignacio 


Monterey, California. Naval Postgraduate School 
http://hdl.handle.net/10945/23159 
Copyright is reserved by the copyright owner 


Downloaded from NPS Archive: Calhoun 


Calhoun is the Naval Postgraduate School's public access digital repository for 
| W D U DLEY research materials and institutional publications created by the NPS community. 
«till Ze Calhoun is named for Professor of Mathematics Guy K. Calhoun, NPS's first 


Im KNOX appointed — and published — scholarly author. 

Un LIBRARY Dudley Knox Library / Naval Postgraduate School 

411 Dyer Road / 1 University Circle 
Monterey, California USA 93943 





http://www.nps.edu/library 



























































































































































































LI WI m var dime TEN ; ac e Na 
D n 4 + vi arsa KEER LE HEK NOTE ETE RATTE OUE | ia nao A 
NW U sé DÉI E ` ñ re ME Wee ere br PER OM SOW WAS MY EE EET OK 
ne oU] ou D N * D O a AM Mad lc "Raad er NT Yr HEER MET TEE TE pv el oa EH 
a ^ y ol A ASA TAR RE ALTEA MIRAS sams 
D TI sa 1 bi. Ha à : EV wyn ww NE NE EE NN AK TT LA BERE TEELS EO 
H fè 1 II . - ANI OR NI TACA Or DA AG Pe end de L L EG OCA 
U ro ‘ V a Rx wch LA E dr dU bl tw ww 
1 , H ' | £= Li A A W Yi LÀ MT A pensad e ec a rS Aid PLURK TE OE Id 
n U ' às a? Å 4 MA] IN EN isa y r ga ARRA LSU OE EE L 
D É. D A A Ss Be dy Ion CN? DEOR NY (ew: ET ANT EE YN W TI TW LA A kil ! 
D N Li «} ADD La las si PA PW Pen echt) AM Py vie vereri ELLEN Doebai kad 
^ D [ee a on n 4 , E E NE ry yw y MT ATLETE, PM AC Rp er t rro ler ur t 
E J LA H Bs AM g" 0 POSEN DELE DAT TT MNA nX DELE L 
o n DELI D "NEGLI US. ' eu rap ya TI TYPE NN rh py hey WA D LP Mw UN AD UN Voy TEE 
n U D V U) E ' Ve d 1 1 TW WYN DE A OM TIREN A Yr PA E ode 
H , 3 M s N w TA H are é e LT HT UN PR a Mp A dp ea AA en 
H er U AT MET E TI DN MI T NA PT] A A t La LL SL. uu a refi via ta GU 
N a N L k 4 Ab ADA ON rer OE “Tr TARTAS SA NA 
D U ' U " U AR. Rad A EA SR Fey AAS TE TERS DEWET ET ned eta 
Ya J A LI has Lm pp peer UR REED na Gar CR LAOT T T Y'r 
E D He gd my Y Y e e Y I* NR EI LL L L dn PE TEN tr yr 
A r e b "n = has Ç Ve EE ef WEE NE VER Da ERF or pri EEE ETE Re. PP Me De Be aand 
l) P F D PA = "TS Dantes ECO TE WEER DEK EER TYE KOS e mns 
d | Kg n draer Ae Geier FFF SH O O, NY 4. 9, 900 18240. Fe eeu ch we N 
. n a doa | ' R.A Ne A Vi EWECH pi. Ee A EET STA cgi = A TE TE rer Rédom er 
.. D 2 . o << “Y SNE VG A PEP GU WWW tn yh WR Py ot TALE LATE] 
n . A Au. a AS. UN? t Y tar a y EE EE TA Rëck? RUE A ANGGE TELA HON TL 
A A P 4 N E Ka Y d aa d LW ef EP FLI EMS EO LARR ET LOSEER SEEK NE YY KE Ed] LIII 
A i n 4 eyo o%s Li P Hi Lm DEA) TEL TE EET EN H 
E N t, br: W^ JA". ATAN Agric AGA, nds Rore 
n " a 1 TD PRP ' E P Pil i9 44 Hod RA Hr my c ba agapan ad 
n ' > TE) a. . a n = EE NN Y CAS. Y Y HES ka e dia 
7 1 N D n N U Bech, Sn ETE ww y OT HA EE LOL MES LT ee. GE RE I SL] 
1 ED. io A 4 å ACESSA NN NN AT UT T apang ng Ñ 
D g 4 a a H 5o ¡AA ME TEA TR A TT AAA E n PANG TER LIE TY 
n" H (si? Ae $ * $t I ki E'N. MAU wd“ md a kle MA ch so kan! MAIER ENS foa rhe 7 
, E i E 4 a e EI LL OY detw wk sh D D AGL. Y d EA e ed rr Ac derde dead) TE te NG 
; E E t 2 [ Mo ki ARTS YY Se rwy (<, lol sp VET le ur eel, 
T JG + fact U D o STI KHN i No A. XA E ARTO A Loa nd T 
" ' ' H xir a 2 "å T "TE KOU Ap se EMAN YE YO [AN ZE TOY Abad Å EU yl baba tic eae 
R D A " s “Y P LWA da kad Z 1 DEN mr M TT TEE TOL IE OT HA ata la Edi La LET TES 
E ‘ U M a LR TERE ROL DOE AE Mes TE Mere AA sab 
E H D ' : | 3 PI Qe Dee a e FC E nya GOL e JL EED ae Gis cryd ed 
n n E LU . rrr v YT Ap er idd ost Belle Ar a a ai Da a ay sp tpa 
E ED D 3 PE N D 1 wer Fre AG `. Ay TT) PA N ala NS EO KAYE KAT LEES II 
D D 5 n U ' a ES ' AY TOUS PEN PG AY LAENG SS PE ADOR, UY Wy DEE 
N A DI DD ` Er "YS LKW strate AA ama E ed Ag y 
N E " E N a x N TE VLOT VEITH IU TH VERE STOELE OAK MA 
S . Li ! N ' Cu aj TNT ' SETTORI EN E e T bad lle is ener 
a A n i U H 4g f Anh. Ros ds VIR wy eue. om ren ' m e Ls nds da 
ZR Yè IN — RI AN. pen TOUS A E 
G D E n " 1 Li A ULLE ERTS N LAT led EA EA LIG 
T N , ' WW wb T D TE A e a EA AA dw DUET 
Ae zs A A E D x L^ "T D ` bbt KE TECH LU EL Vries 
A PP " EL a * : D D EER vee Ud GA PT MU ce d 
: Å U ters B n Dë: "ek" a hc ASD AN) Dei use mL N EM to Deen fan, ri M29 f 
d sch D y Ng [E i PD "e OTIS EE. MES GT å EK LE TT ED N Kee 4 
Padi n To ma 5 u n E VAN 3 ST O Y AAA a Sa DON WD A e ec B 
SA E n E mee " BLIN a eee Rs bb us EST yate a 
"in m u ta E Ded m PM e S [n A DOELE E E eA nw oq ur Ip reae Te err De 
Š G ° ° 1 D E nae E > WI MN EG EET POLY w (Se ata Cu Y da GT Y, Y, wya TE 
| Ke n ES Å N a quo as n eee mm ep RA Li KTA DT ege CA RE? 
D DI L på m O n ' Li D D D JS me aal n A so LEE Da reina ya ETOS LE A OW bal eli MAS 
a a E TD H n n n og T E AM ate ^n U E - NY ANg PIN Pi. Apo Ed ANL LIRE rh. kat era 
: FR) D D H ME ^ n Kee? mr TV Drm m ISTA KIC ER hael! Det kä A YO Ya ed 
D a ' n w "bail sa E n P : M NND mico LEE EE dna eis 
a ñ n . to PE SE ado os EDU Week eg ei AAT ECH ode E 
h 8 a: m DH FE a H “ D DH AA te ATA f mI PE lyn LNW YO w ^mi bh bad E DEL LET 
Oe n f ^ ` n DH "n ^ `. DM PTT EL PP er De SD a 
B Dar, ^ c "før å H H DN SCH TDN TL EN J ELE moen Go AA O re A 
Tee 3 H Ñ E D P ad: 4 e P To ar il 1 ` MET ETE ED 
ac $ E n $ tote oe t^ GM, Ca Gar WY ESEL EE 
E 7 A 5 re 3 A . n N ARS KU onm qn) L Š we big ad LU ATI dd 
d å MEG U "E as " “a y "Au Ng er 8 AA LT SITES ETA LAA aper tiia et 
P n E E ELS [ORO * ¿$ d Li l ' " N ' `x d Pe Aa, de a as calm ETT DLL KÉ SEE a SE GE RTL NN Hen aged. ang 1 
GN A Ñ D ' E T an P E: lab t Hu EOD ER babab DLE 
OU x x 3 H a , HR M p a P We A A a LU Y ` vd mg Aa ki MA: E Ha a rad bng hd N der 
É É H E L " 1 , P H L aa" DM MM * DES ge tegne, takene 
" : x tir n 3 m a D ' e A " re GR ed DI Re baga gl Ht. PR Ge Ek * e 
EN A P 5 i 5 eh E q NG Lë < wen y 
EE SC, s | 3 | YH NAPARAN KEE 
A a s Der} A a H « DR * 3 E ME LI : U ao Eu ee dr pu e V LEE] a O 
e É o Cas ° I x Ew A5 v t ts. ir LE ymd aM oi ei AKE EE ho 
` DH no E n PE d - `n ves "P u år Sas Hie i Åre d Kaha ty EX Vi eT em 
š etry sa q D Ho D e er DU ` C = Ni "ag 35 sa Mad R A KE EE Er res S 
H L E E 1 a A a U H ki Wutee Mae mega YS ae P: ak cy Do pre dn del Aë sang Eë yi 
n ^ 5 a OD UL en P ` td n -— NI TR ' . m NL ETE GE DT ede ES ar np ELS oe | 
E . Gë E S P : Ps q Bn FU E RON. ^ MR TE TA IE Oe YD De aret, a PE N) Deed 
PEE A D Ñ D B L H H U p E A Bee n yon oe H ? N å ) HL KM AA E der A. GE 
e E De? = T n E e UCH m4 do OM RT A 
" an Na N UN D P DÉI Pee * E a m < Jn yo A ie KA een 
n k us yn ch GEE dr da 
c 3. ESCH Ae Ke Zäit Do ru 
a S" (fs PA mv a 
me ago ndn K an he So win d LIT 291 nr aima = 
>: | Yd WE su ee 
Port RAS UIT e hen KEL ere 
n "k ty Sa E gen bett 
z ON KE EE DOE a 
Para PW rT a y” 
eg "feriri AB aite 
EM SE lo we E Fire EXT. ate 
E L Ye Se bid y'n n 
ke C E LP Ee LE 
A > we cb ih, 
ET Ee 
E oi WA od n md beet vs) 
Gå OE ed be Jet dé 
5 or Men 
si GERD EY. ` 
a FERME! >; LZ ONN t 
DH kad = w»! 
K > ODER ^ E 
bla 
i PER sd gee S 5 
a AU 
MAT N AA 
E n PER Té 
1 
n 
Cor 
1 
L ' 
A s 
ta ER M 
"on EO 
D sit 
E E 2 EP = 
ng LI GEK e 
l) 3 EFE 
i magsi 
P CECR 9h y Sei bé 
H LERE PS sørg. 
E $ Z es Ar 
D > 
PSG 
.. - 
LE" 








Xs 
TT på 1 
, nw 


TC A 
Art 






sta PP Eu $. 
PE ART 


er 


5 Pad da du 
“sa LI 
TEN DEd 


oM 




















NOTE? 
g "T 


FI ET 
BC E: 














































































































: E (Sa ge D O ran ` S. WYTH A 
1 E 00 a. yet ra t fc, n ty m Da o ze MEH ANIG ee 
TD Fn 4 4 nun TEIT PLU Hon LAN er ma Dur ¿"de A se são q 
' a 1.0. K FE] D H A HA LÉ DU a oe. a Å dd Fi 4^ EM e: Jes] yo 
O PRA o tat pay  BAxont ee MR C P A rd EIDEM Kë 7 ei E 
D RED AS MEULE ME EE 
MYD EET D DAD IE MA NG TALE AS ET TR IG 
E OUT RT AT s^ "LE NL LLL D D 3 d 4 
D a ñ vu" ke ETA N E e gutsy) cots ETT 
U ung U LO H DW A H Li be ZO WA H 
i d L t O HS D "I Le ë å LS o ..r 
S E, TC ber) GANZ SALA OD k PA? TOL ST aftel 
IE LE SLAE] DL "i ARE A TO LA Dr ur EER IT) Q 
b. 


U 

m fi: P Et 
SEH SCENE ée, DEET oe 

wn) gryno EL m d 
Cat Pn Core rx PRA manas E 





PARRA ET EE FT HEET rer 
U OE POP ar ab. 44050 y“ 7 r,Wo 
Mar EK 


PE AIS 
ae 











TET APA i ESE 
WE RIA TM 
og tng Faby ou 









































Se ETES wy 

EIE TEATER DNE TE Y] PIN E ET 

. ey n "MTM ME EP D E DOM D Du E "Tn n På CD NA PR m 

U "ege Ae N DET 4 Ñ LET M D m^ (O oe Za e Uwe TT 
AAWL ` e se a MT TA ee vr 

Ne EER WE Å d Er e Ups EA ATO CU 


ago PAYER TAE VES 
“ale er Og EET 
- BE Pate rios 

H DEEL 






MEES ETER LIEFIE IS, 
EP EA ek 

n . sat para 
TY" MACH E ke f Lee = o 
H SKS l E 1200 Par 
TEE TAP 
NG: et ee Ar KEY 
LIE EET EE ID 
bat yo Eer, "I 







LN KA SI 
"us „ee... 
OM? "E 
! N 
Tr 













KG 4 å, E 
gU [P] IE 
ya 









n 











H po ner ar 
OS ATA C her Mire LAUR M PA gr deled 
mn "UP eet MAM PE på P san dd PN YD kn 
TEIR EER FIT ans il GETS paro d 
“ a ta PN "e. ED ve E uo. TT] e FEA 

TN MOT SEDERT AAR Og rn EET DE did 
ETTER T EA LL $. a H Rad n We sigo pi de ras PE 
ALL Ki YOU LON avwe sec ae T LS CTI ea "tY "Y 
win (SE TE TT nu "m P MERE TE ET Lr rM m" mere es ap aid we 
TE TOE ME EG "P HE EA EN E a ak ai Py ELE Li re 
if a oh re LW at WEEN UD UT FM dd do É SE D roe pire DLE EES hadi 
pa 



























































OOR ADRES D “o E PLAT H NN. '— —— NEE L WT e 
H D 28 6 SL pha "EP "mU ta'w, LTL RIIT (EH en wind. a PE gl cb 
TA A O , r y tz, On OR TO LLC Ph AL ELE Kean 


aa OT ET ETE di E. CP EE Me Wt mer te 
ah p PT hi Ad Lo r mas GA EA E e AAA ASAS neeg 
t ng "n? VETT fe a Y» DER TT $y o6 FIL rn... LIIS Ha Dii nic da 4 
AE O TD TER bia sea 3 ' DNI A E AT LAN Lane IW TT GT KEN TE di FELT 
Ar MI de y LET ' PR NO a tee Ste TEST ra A 429. RE E Gu“ a el, 
3 D ry DAT m] suas KIDS LAORET Pn TH se pero psoe PELA 





AE DX T 
e. ETTER EE py 






















LO ay EE EST LI 

















































































































H 


. 4. ? ay, PD ND motos al AI CO nd ae "Ce prx L PALA A A 
ee a IRA en AA EM OC "mm LP Leid 
. 1 oa Wes 2a 2.0... HE: wlang Pi. é as AA Onè pe L (UG ma DOE IT TE N B UTE Art TE EETL IG dd 
na m " E "nr N , 3 Wea Wei Gr > ETTE ¿DA e A Ito .. [t P “ort Pit EE AR EG ELS 
DN E. T a yon 4; TT IT EDS ER AW FA ETD EL MN e Ch 
D e ER LAM dy bota d LTD ET 3» AGE TT" Ka Pi PAg DO do yani 
SILA > > "y n Eg WNION AS) D ny nns DE DES TED TEE ni ei ET TEL] 
- .4 de . NEE EN) re $ er; BA VAN um UT E IT P P 
PL ON A dg i] på GO, Jara Mer. "m A s en A.Q FED s adi ei EDE my TEL zZ 
. BEA EE Dur ur; (LTL DIS guto. A Mr ra MYR NW A A E CR N TAT SDLT 
Kaes > sa .. d Sr, D: E TAD Vë BR, E et SH wre DAYUN SAWA IT AAN ER P LT PERE UN 
SAN BO NA me ` e. YT D TT TEE EED EIE 4 vo TH CORTS I kee 
ta «n p b V la. A AW : D. IT RI LA NGAY PI EPI PA] re FEIT TEE DT EIE IG al 
OU e t Pen ODE A n H E Dn pi a any ret. ET AMY LIE LEN TOE ETE FEED STAT ad 
rn roy OE md Ave e EU DIE EPA TE ENG HEAR ALES vue TATWS 
sko abo msn Wd tos ELEME PD b B ade ^ $ Partir: ee gau je e kin E) DR (KN WH NEE EE 
O el d EE En OE EE so n whe rs ede A idos APE TEE id PAO IS AUU LL 
a*n uu O Te DNK (va PE AA) WEAD re rk 
LVI dite) vrs DF å Se EEE joy KAN pee TT LERE IE ORREL EES 
fdp LN RV 46 Pa MA LEE A en Ki e Pas ygan rg pad va Me se. HE 
Se pa LUE ELE MI E 34 DIN chat d 1, 10-06 LLL MG PIT Ted vwe shp py FI LIT A SPS 
J "a La x a? ADJ e“ > TS od AUD EI nw wy pan pw en "MAITH 
a MA; LCD PT OD gi VELA Pan in ae gt eh ee: d ATLETE "TOY o 
OE ED m y SES DE sa Sure arias AA A dls 
N VV i arn EE YL 9 T D /'y*p ta? ER yr atv 
OO ED e ond de. DÉI «wd. ETE A AY 
iv 11 Liv dias. Y UL LON OE tf? GA et AR EE dd Me PR Pg re | eia mum 
KP D Va M ALTER aa dle 7 Y SD 
' 2 A E EE E TI pr Pra. stn a beid AGA WIKA eed DM DE SEE TI Y Kä oy di tet MAL vg 
a A ute ET CN Vit mont ETA tpi pm AU 
LEG "IG An E GL LG be L Me 
M 


TM LE pè. Kéi DE TM 
MET P AS n 
Pao did o dd G 
CHIESE EET 
de Y Di Katia (d a 
rs MAL gie) 3 





iv 

A UN od 
sr den ASI OT GT del i i de 
L E ON Ave FEE E 
H sy kdo fy Zë NIT a 
bed TU Sab GA AGT YR men Por 

DE ER EG EE Y 
L n a gin e Ke 
AL oe ae ev) Part: n N S EIE AD 
Mon) DE Lé 






UN Kik? 
MARK CR e ahn e dän 
vi TEL DEE at 
MINAS E LAICA 

AA EEN # OPEN pA yoy 

"T k T le bor s et ETES PA VG 

DL “i 12 OU EE SEE T 

Pa LAT BAG EA d Y aW A y" 
LEE CALEL 














































DP TP LL run EMIF UA JE rd JE dud 

E je A) tg y^, X RETAIL (A VI bo S Ar yra IM HAS) 1 ER EENS Unes 
MEEN m LE RAS IN RA wu pr CAU ELE E 

tai » H LSG (eh EA) Waw O edt eda d E aa YU. Je TENG e yaren jo VA AP yo ada 2 
" NND e ANNETTE ka IDs TL MEL ses EEDEN 

P n Ju je v i TR EII a KAR cr a ERG tan LIES 
ro Vo a ^A xo | WD 2X Py y^ yn > TED véi kk ue u 
sa D at. MENT OUT Kn d SET rnb en m operta a PL ke WE od dad å 

H Wa te EDE ER WA GË Ee MP Re MAL Ed ONE Pep ALT re 














NAVAL POSTGRADUATE SCHOOL 
Monterey , Cailorria 








THESIS 


m 
f £ 
j j { 1 


E 


DESIGN, IMPLEMENTATION, BUILDING 
AND EVALUATION OF A TORUS DOUBLE 
TRANSITIVE CLOSURE NETWORK OF 
TRANSPUTERS 


by 


Jose I. Frazao Sosa 
June 1988 


Thesis Advisor: Uno R. Kodres | 





Approved for public release; distribution is unlimited 


1238216 








SECURITY CLASSIFICATION OF THIS PAGE 


REPORT DOCUMENTATION PAGE 


1a. REPORT SECURITY CLASSIFICATION 1b RESTRICTIVE MARKINGS 
UNCLASSIFIED 


- 
















2a. SECURITY CLASSIFICATION AUTHORITY 3 DISTRIBUTION / AVAILABILITY OF REPORT 
4 LASSIFICATION /DOWNGRADING SCHEDULE Eni release: 
Lu GS Sieh OU Eon puIPrglted. 1 
4. PERFORMING ORGANIZATION REPORT NUMBER(S) 5. MONITORING ORGANIZATION REPORT NUMBER(S) d 
| ' 
| 4 
| 6a. NAME OF PERFORMING ORGANIZATION 6b. OFFICE SYMBOL 7a. NAME OF MONITORING ORGANIZATION 
| (If applicable) 
| Naval Postgraduate School 33 Naval Postgraduate School 
E ADDRESS (City, State, and ZIP Code) 7b. ADDRESS (City, State, and ZIP Code) 
Monterey, California 93943-5000 Monterey, California Suo SYO 
Ba. NAME OF FUNDING / SPONSORING 8b. OFFICE SYMBOL 9. PROCUREMENT INSTRUMENT IDENTIFICATION NUMBER 
ORGANIZATION (If applicable) 
Bc. ADDRESS (City, State, and ZIP Code) 10. SOURCE OF FUNDING NUMBERS 








PROGRAM PROJECT TASK WO«ín UNIT 
ELEMENT NO. NO. NO ACCESSION NO. 
TITLE (Include Security Classification) 


DESIGN,IMPLEMENTATION, BUILDING AND EVALUATION OF A TORUS DOUBLE TRANSITIVE | 
ECOSURE NETWORK OF TRANSPUTERS. 


12. PERSONAL AUTHOR(S 
SI Erazao sosa, Jose I. 


3 


EM TYPE OF REPORT 13b. TIME COVERED 14. DATE OF REPORT (Year, Month, Day) 15 PAGE COUNT ) 
ENT té: 's Thesis | FROM TO — — — 1988 June 168 


io. SUPPLEMENTARY NOTATION. The views expressed in this thesis are those of the authorj 


and do not reflect the official policy or position of the Department of 
Defense or the U.S. Government. 


17. COSATI CODES 18. SUBJECT TERMS (Continue on reverse if necessary and identify by block number) 
FIELD GROUP SUB-GROUP Transputer, Parallel Processing, Microprocessor Networks, 


Po [| | OCCAM Programming Language, Heat Flow in a Plate 
A ë 


19. ABSTRACT (Continue on reverse if necessary and identify by block number) 






Currently the design of highly parallel "supercomputers" is one 
of the most challenging problems in engineering. : 
The purpose of this thesis is to describe how the problem was | 
approached in the design, implementation and building of a Torus 

Double Transitive Closure Network of Microprocessors, using the 

T414 Transputer as the basic Unit of Computation. 

Also compares the performance of the evolved model, from one 

Transputer to the final stage of sixteen Transputers running in 

parallel. All the programs and examples presented in this thesis 

LE np lemented in the OCCAM2 Programing Language, using the 


Transputer Development System, D/00c, BETA 2.0 release March 
Pos compiler version. 


20 DISTRIBUTION / AVAILABILITY OF ABSTRACT 21 ABSTRACT SECURITY CLASSIFICATION 
[4 UNCLASSIFIED/UNLIMITED ÜC] SAME AS RPT C DTIC USERS UNCLASSIFIED | 


‘2a. NAME OF RESPONSIBLE INDIVIDUAL 226 TELEPHONE (Include Area Code) | 22c. OFFICE SYMBOL 
Uno R. Kodres (408 ) 646-2197 | re 
ID FORM 1473,84 MAR 83 APR edition may be used until exhausted 


SECURITY CLASSIFICATION OF THIS PAGE 


| 
All other editions are obsolete ŵ U.S. Government Printing Office: 1986—606-24. 


Approved for public release; distribution unlimited 


Design,Implementation, Building and Evaluation of a Torus 
Double Transitive Closure Network of Transputers 


by 


Jose Ignacio Frazao Sosa 
Lieutenant Commandér, Venezuelan Navy 
B.S., Venezuelan Naval Academy,1974 


Submitted in partial alt ME HEN 
reguirements for the degree of 


MASTER OF SCIENCE IN ENGINEERING SCIENCE 





ABSTRACT 


aurrentiy the design of highly parallel "supercomputers" is 
one of the most challenging problems in engineering. 

ine purpose of this thesis is to describe how the problem 
was approached in the design, implementation and building of 
a torus double transitive closure network on 
uMcrODEOCessors, using the T414 Transputer device as the 
Basic unit of computation. 

Also compares the performance of the evolved model, from one 
Transputer to the final stage of sixteen Transputers running 
in parallel. All the programs and examples presented in this 
thesis were implemented in the OCCAM2 Programming Language, 
using the Transputer Development System, D700c, BETA 2.0 


release March 1987 compiler version. 


ME 


THESIS DISCLAIMER 


The reader is cautioned that the computer programs developed 
in this research may not have been exercised for all cases 
of interest. Wnile the programs are free of known 
computational and logical errors, they can not be considered 
validated. Any ap pi tea on me these programs without 
additional verification is at risk ok le me 
Many terms used in this thesis are registered trademarks of 
commercial products. Instead of attempting to cite each 
occurrence of a trademark, we list ali registered trademarks 
which appear in this thesis below the firm which holds the 
trademark. 
INMOS Group of Companies, Bristol, UK 

Transputer 

Occam 

INMOS 

IMS T414 

IMS B004 IBM PC add-in board 


TMS B008 


iv 


TABLE OF CONTENTS 


i. A NTT Ver so o e RER 
OEI PE D J eS... ve e esee es s. e 01 
ee EG SMAA Group at the NPS 2... .......-. 01 


2. Considerations and Terminology about 


i euc 2 L Leia o. cc ee 6 as 6 6.6 46 6 ae 0 0 oe oe 02 

ETE CE RSS cravos wo 08 
NS EED so... NN. oos see see sw ese ese 08 

Ds OEA NO EE Tee. 09 
GE TREN OR RA L LH re 0 00 10 
II. DESCRIPTION OF THE HARDWARE USED IN THE NETWORK...... 212 
a. EP GATION OESTHE PRANSPUTER IMS F4dld......... e E 
IEA E SAS NN... Tear ar um 13 

Ao Es Ese adders y .......... ... 15 

CERCO mt o e 1 7 
ENE EEN e a um au o 18 

EN Emo cT... ee 19 

6. External Memory Interface and Events........... 20 

ma ssa ERE E rrt mtm mg 2 
CN OEI OS... uu... Y UU UU on 22 
EL EG di TEL BE ADD-TN BOARBDI A rex t R8 23 


1. Initial Requirements for the PC Add-In Board...23 


Sere lee OO poms OAC rsrs aa 215 


III. DESIGN AND EVOLUTION... eee e s e e e e c a oa N 2 


A. THE MODELING PROCESS. . 59559. 772000019 JI 29 
l. Description of the Problem. EN 27 

2. The Abstract Model................ PU P P P I Y 30 

3. The Transformed Computational Model. re 30 

B. NETWORK MODELS AND EVOLUTION. ra 32 
1. Network Classification. ND TD 32 

2. Model Evolution............... pp EE TD 33 

3. The 16 Transputer Network Prorotype....... a 38 

4. Expandability of the Model..... ca 33 
IV. EFFICIENCY CONSIDERATIONS.......... we DD 3 Y iie 4l 
A. INTRODUCTION. 2.226 0666 6 6 ale SNE Al 
1. Generalities............... se RE EE N 41 

2. Terminology and Concepts.. as. RANK 43 

a. Power and Energy... sak W —— — 43 

b. Efficiency.....«« <<. a PP TT T N 45 

3. Applications of Efficiency Analysis. aa 46 

B. MAXIMIZATION OF THE TRANSPUTER NETWORK... P P E 49 
Generalities..........«.. ^ 2 N N 49 
Maximizing Link Performance. OMM 49 


a. Decoupling Communication and Computation NN 
b. Gather Together all the Communication 

PrOCESSES...... 0a SI EE N 50 
Cc. Large Link Transfers....... aaa 51 
d. How the Boundaries were Passed in the 


Network............... eile e eIae.ae.ueuN.EIÍO C W S Sa 


vi 


m 


V. COM 


A. 


Be 


w 


Lu O m tl 


I. 


VI. CONCLUSIONS AND RECOMMENDATIONS 


A. 


BS 


SE 


APPENDIX 


APPENDIX 


APPENDIX 


APPENDIX 


APPENDIX 


BIST OF 


SEE FT OE IE SYSTEM... dd ese des ds es es see se 
PARISON OF THE NETWORKS PERFORMANCE... . ss se se ee 
ARE WE USING AN INDUCTIVE ARCHITECTURE ? ........ 


GEEN TENT EVALUATION. ET... I... UT o 


sn NA, YD ea [ed seco cs, 


TRADITIONAL APPROACH TO THE SPEEDUP ESTIMATION. 


Sy (Oe JD. Sd ns er 
GM ee e NN oe ese ee see ee 
reo Pa MAI ZONE. EE oe oe ee eed ee os ee see ee 
JOU IE PARALLELISNAWAS ACCOMEPBISH. . Ha ss wa a o 


PAANAN GANG ARA AE ee es ses ees see 


AS EE OR OO N N oe oe se see 


ES SIE > TEE IRANSEUTER dd. sees ss esel 


EER MEME aEIONS seen een 
A O ISTEANSPUTER NETWORK SOURCE CODE... .... .. +. 
B PM Transputers "NETWORK “SOURCE CODE.......... 
C GU On SEpuce'r 'NETWORFSOUROCEFCODE........» 
D N ML ars PAL SE ANETWORKISOUREE CODE? «eee won en 
E EXPANDABLE PLACEMENT FOR 16 TRANSPUTER 

NE ORKE OOS. N ED 

ERE NNN ETT TT ea e 


VII 


* 6e e 9 9 9 9 9 9 9 9 9 0 o 8 8 0 e e a o 


pa 


LIST OF PICURES 


IMS T414 Block Diagram... . ss ae e 12 
Transputer Register... . cc coco REKEN 14 
Transputer instruction Formar WM Mg 19 
Memory Map... «cacos ce TG 20 
The Transputer Links..e.c NN AE 22 
Block Diagram of B004...- MM CTI 24 
The B003 Board... ses soes EE ENE N 26 
Heat Conduction in a Square Plate ...... —— RR 28 
Abstract Model...... O O NN 30 
The Transformed Computational Model a 32 
Model I....... 9999 etx EU N N 34 
Model II.............. 9. c c P P Es 35 
Model III in Its Different Sizes........... aaa 36 
Data Flow in the Network... Rs aaa 3d 
16 Transputer Network Prototype... ca - 38 
01 and 04 Transputer Network Connections... ....r.m 40 
09 and 16 Transputer Network Connections... o.m 40 
Energy for a Serial Adder and Parallel Adder....... 47 
Boundary Exchange................ DI D N 52 
The Inductive Mesh Architecture. ÓN 55 
Efficiency Comparisons... -<n nee A ANN 58 
Relative Efficienty.......... o. ea Gi 
Various Estimates of Speedup and our Results....... 63 


vi Di 


Throughput for Different Grain Size in a 04 and 16 


ek ea NO nranspucers AI 69 
Meme update Mechanism in the Network............. 72 
DN ST E... ES VE. rre» a a p 
Mas Tin E A a R e a 76 


ds 


LIST OFZFZBEES 


Prototype Energy Results ......... PP S PP P P Se 
Efficiency Comparison for the Networks aaa 60 
Relative Efficienoy... EE EE e D p D CW SSS o 
Tnrouahput Results............ Aa Go 
Throughput and Relative Throughput... asa 66 
Throughput Results for Different Grain size ma 69 





ACKNOWLEDGEMENT 


This thesis is dedicated to my wife, Norma for her support 
and understanding, and our sons Aaron and Guillermo. 

To my thesis advisor, Professor Uno R. Kodres and the second 
EE der Major. Richard Adams, I would like to express my 
thanks for all the confidence and support, given to me at 
EN ie when T needed. Finally I would like to thank the 
technical staff of the Department of Computer Science, 
especially Russell Whalen, Rosalie Johnson and Mike 
Williams, for all the aid lent to me during the lab research 


for this thesis. 


sm 





TOEDNNIRODUCTTON 


A. BACKGROUND 
Ie ABGIS Modeling Group at the NPS 

The research interest of the AEGIS Modeling Group at 
the NPS, which was created at the late 1970s, is to 
investigate any possible alternatives to replace the U.S. 
Navy's mid-1960 design AEGIS COMBAT SYSTEM, and the main 
focus of attention is the AN/SPY-1A phased array radar 
processing unit. 

Bearing in mind this objective, at present in the 
Transputer Lab, the main thrust is dedicated to exploring 
pae possibilities that the Transputer, a VLSI microprocessor 
developed in the United Kingdom by the INMOS corporation, 
could have in the update process of the AEGIS system 
BEN n use on the U.S. Ticonderoga class (CG-47) 
ezuisers. 

At present the Transputer Lab at the NPS consists of 
five Zenith PC with B004 Tranputers boards incorporated, two 
EUROCARD BOXES, one B001 Transputer board, one B002 
Emansputer board, two B007  Transputer boards for graphics, 
four B003 Transputer boards with T414 Transputers and two 


B003 boards with T800 Transputers. 


2. Considerations and Terminology about Par MA 


The design of parallel computers is a new frontier in 
engineering. Since the device and technology is not expected 
to increase computing power as fast as the increase in 
demand, novel parallel architectures need to be designed. 
This design is exciting and important to the future of the 
computer weapons oriented industry and the national security 
research projects in this field. Also as with most new 
frontiers, it is often wild and chaotic due to the ki 
data and methodology to compare the many good designs 
already in existence. 

To help the reader to understand and get a good grasa 

about parallelism here we have some terminology. 
We will start with the basic discussion of terms and 
concepts in computer architecture. While the readers may be 
familiar with the terminology, some words were used 
differently, therefore it is worthwhile to have a concise 
statement of our use of the word. 

We define a processor as a device able to be 
programmed by a user to act on some data, a procedure as a 
set of rules that a processor can follow to modify that 
data, and a process as the execution of the procedure. The 
Transputer is a microprocessor which includes a processor 
and special instructions as well as hardware to provide a 
maximum performance and optimal implementations of the OCCAM 


model of concurrency and communications. 


The OCCAM programming language is the first language 
to be based upon the concept of parallel, in addition to 
sequential, execution. It provides automatic asynchronous 
communication between concurrent processes OMNE Che 
assembly language of the Transputer, because the Transputer 
executes the occam programs more or less directly. 

FEES puter system 1S a nonempty set of Transputers 
ieruding support components to connect them. A parallel 
Transputer system or Transputer network for short, isa 
EE EL ON Of two Or more Transputers that is built to work 
iN parallel. A Transputer network is no more powerful, in 
terms of Turing computable procedures, than conventional 
Bengsüuters. We Can Characterize the networks of Transputers 
by what they can do efficiently. So we will have two 
fundamental types of Transputer networks: the special 
purpose network Irons pure rs designed for Spee it 
applications and the multipurpose  Transputer network which 
designed to execute most Turing computable procedures 
efficiently. In this thesis we will refer to a multipurpose 
Transputer network specifically designed to explore network 
programming with shared global variables. 

The architecture of a Transputer system is the view 
Of the hardware seen by the (systems) programmer. Two 
machines can have a different architecture if a programmer 
can see a logical difference between them. A paradigm is a 


Bet of architectures based on the same principles. 


The Von Neumann paradigm contains almost ai 
multipurpose computers. It is the very well-known paraaan 
jig nre a controller: data, memory and (I/O) are 
sequentially programmed in a fetch-execute cycle, || and w 
contains move, arithmetic, control, I/O, and also on 
instructions. The implementation or organization ISS 
block diagram of the computer which shows its memory 
processor, I/O and other components, and the realizatorii 
the actual hardware of the machine. We will focus on 
paradigms of parallel computers. 

An architecture or paradigm is parameterized if, in 
the view of the programmer, it has parameters that describe 
it. Parallel computer architectures may have a parameter, 
such as the number of processors or Transputers. We can 
characterize parallel computer architectures as bounded if a 
parameter such as the number of processors can be 
efficiently used, and is limited or inductive if the 
"inefficiency" of the machine follows some reasonable 
(e.g., sublinear) function of the parameter as it increases 
inductively (e.g., as we increase the number of processor 
from n to n +1). In this thesis we are basically interested 
in the inductive parallel architectures. 

Two other parameters are the number of instruction 
streams and the number of data streams. A single 
instruction single data (SISD) stream computer is in general 


a Von Neumann computer. A single instruction multiple data 


(SIMD) stream computer system has one instruction streams 
(Procedure) simultaneously operating on multiple data 
streams (data) in separate processors. 

EE nuns p= instrAEKFION multiple data (MIMD) stream 
eomputer has a plurality of different instructions stream, 
each operating on its own data, we focus on this last type 
Em this thesis. 

For our purposes a plurality of procedures that are 
cooperatively executed on a MIMD Transputer network is a 
MIMD Transputer network procedure, a MIMD Transputer network 
process is the execution of a MIMD Transputer network 
procedure. 

In a MIMD Transputer network, the process is clearly 
a component of a MIMD Transputer network process which is 
executed in one of the Mean pue ES where several 
Transputers cooperate to solve a complex problem or operate 
independently to solve different problems. We will be 
Encerned with the efficiency of running a simple process in 
a MIMD Transputer network. 

The programers may see a machine that is quite 
different from the hardware machine, because the functions 
available to him are augmented or modified by software, 
microcode or hardware. For example, a MIMD machine may 
appear to be a SIMD machine by means of the software that 
implements the synchronization of the processors. When a new 


machine "architecture" appears due to the use of software, 


microcode or hardware to change the view of the machine, we 
call this appearance of the hardware to the programers a 
virtual architecture. A virtual shared memory system can be 
created by duplicating information in local memories, so 
that when a producing process writes a new value into its 
local memory, the operating system then generates a message 
to all the consumers of the data. The local memories of each 
Transputer in the network contain the duplicated data ready 
to be consumed by each consumer in its local memory. In this 
way, we have the illusion of working with a Transputer 
network which physically contains shared memory. 

Another interesting concept is the communication, 
scheduling and synchronization mechanisms between 
cooperating processes to in a Transputer network. One aspect 
of this is the granularity of the architecture. ar 
granularity architecture is one such that communica— 
scheduling, or synchronization occurs within an instruc uM 
such as in the fetch-execute cycle of a Von Neumann 
computer. (e.g., the Transputer OCCAM programming language 
with its primitives processes send - ! and receive - ?). A 
coarse granurality architecture implements these operations 
in terms of instructions as a whole. This definition beleng 
to the architecture and must not be taken as the granularity 
concept for the parallel programing. Granularity in parallel 
programming is a commonly used measure of parallelism, and 


is an indicator of how much computing each processor cang 


independently in relation to the time it must expend 
exchanging information with other processors [HOMO87]. Then 
a fine-grained procedure spends relatively more time 
anc a chan Calculating, in relation to å coarse- 
grained procedure. A second related aspect is the degree of 
Cs lioo A loosely coupled system uses the approacn to 
communicate between simple processors, while a tightly 
coupled system uses data  *ransfers within the instruction 
cycle to provide communications between them. Tightly 
coupled system generally require that each simple process 
has a fairly extensive knowledge about the other process, 
while loosely coupled processes may know very little about 
the other processes. (Knowledge is either an explicit copy 
“ne “data that controls a process, or an implicit 
mechanism such as compiling the procedures from a common 
source program and running the process in "lock step"). 
Generally loosely coupled systems require handshaking as in 
the case of the transputer networks and the tightly coupled 
system depend on a common system clock to assure the correct 
Eoupletion of a communication. 

A third aspect of communication and synchronization 
is the nature of paths between processors that implement 
these operations. If cooperating processes have direct wires 
between them, as in the case of two Transputers connected 


each other direct operation; if signals pass through other 


processes, it is indirect (e,g., the case of a network of 


Transputers in which for instance the first transputer of a 
pipeline will send a message to update the data in the local 
memory of the last tranputers of the pipe); and if signals 
are handled by additional hardware, then it is switched. For 
switched communication, scheduling, or synchroniìzatlonw 
interconnection network is used. In this thesis we focus on 


the indirect case. 


B. TRANSPUTER OVERVIEW 
la The Tran ELE 

The Transputer is a computer in a chip - a processor, 
complete with storage and standard external interfaces. It 
is a key technological development, because it enables 
information systems to be designed at a higher level of 
abstraction than was previously possible (this concept will 
be discussed later). 

Because of its importance, the word "Transputer" has 
been coined to describe the computer on a chip. 
The Transputer focuses special interest on the transfer of 
information across the chip boundary, rather than er 
processing of the information within that boundary. The 
powerful concept provided by the Transputer links, is an 
attractive characteristic which makes the Transputer very 


suitable for building parallel networks [DASP78]. 





2. Programming Languages 


At present exist compilers for Transputers in PASCAL, 
C, FORTRAN, and ADA (this last will be available for the 
fourth quarter 1988) but these do not have the capability to 
moi tc Ene intrinsic parallelism of the Transputer chip and 
N: ca not take advantage Of Ths Communicacion model used 
bye tne Transputers. 

The OCCAM language "understands" paralleli.n and 
sønnunication at the very lowest level, allowing the 
designer to describe and control the use of parallelism in 
the system. Other languages, regrettably, do not provide the 
needed facilities; ADA, for example, does not, since its 
semantics are those of multitasking system (i.e., comprising 
one or more processes which talk to each other through a 
shared memory), this implies that a multi-processor ADA 
system needs a shared global memory. Other languages have 
equivalent assumptions; any language which provides 
semaphores, for example, is assuming a shared address space. 

OCCAM is a language designeq to make the 
representation and control of parallel systems simple and 
eemerenensible. In addition, it provides most of the 
facilities that a user of modern block-structured languages 


like C or PASCAL would expect. 


As an example of how the OCCAM language provides for 
parallelism is that of the transputer processor which 
provides instruction set support for multitasking gm 
interprocess communication. The model used is that of OCCAM 
in which the keywords PAR and ALT and the communications 
operators ? and ! are implemented as instructions. This 
makes the  OCCAM parallelism very fast; a PAR costs around 1 
microsecond per component, while the execution time of a 
matching ? and ! - including all the scheduling needed is 


about two microseconds [INMOSJ88]. 


C. THESIS ORGANIZATION 

The rest of the chapters of this thesis were organized 
in the following fashion: 

In Chapter II we describe the hardware used during the 
development of the model, including the  Transputer board 
used to place the I/O handler, which is internal to the PC. 

Chapter III presents in a sequential and organized 
fashion the "growth" of the model from one transputer 
through sixteen  Transputers, which is the final stage of 
this design, focusing on model evolution, flow of data mu 
expandability discussion. 

In Chapter IV we approach the subject of efficiency 
related to parallel networks and some key ideas about linear 


speedup and linear and parallel performance. 
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Chapter V is a comparative study of the efficiency of 
the model among the different sizes of the transputer 
network. 

Chapter VI discusses the results obtained in the chapter 
V, and gives some recommendations about what should be the 
main goals of the AEGIS Modeling Group from a personal point 


of view. 


d 


II. DESCRIPTION OF HARDWARE USED IN TRENET 


A. REALIZATION OF THE TRANSPUTER IMS T414 

The IMS T414 was the transputer used in the design of 
the Transputer network called Torus double transitive 
closure, It will be depicted for hardware description as 
well as to gain insight in the functional characterist ea 
the Transputer chip in general. The T414 integrates a 32 paw 
microprocessor, u our standard transputer communications 
links, 2K bytes of on-chip RAM, a memory interface and 
peripheral interfacing on a single chip, using a 1.5 meg 
CMOS process. For convenience of description, the IMS TITAHE 
operation is split into the basic block, shown in the Figure 


2.1 [INMOSD86]. 
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Figure 2.1 IMS T414 Block Dies 
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1. The Processor 

MI Niet processor contains instruction processing 
iuc rnscruCLion and work pointers, and an operand 
register. It directly accesses the high-speed 2 Kpyte on- 
chip memory, which can store data or program. Where larger 
amounts of memory or programs in ROM are required, the 
processor has access to 4 Gbytes of memory via the External 
Memory Interface (EMI). 

There are only six registers in the transputer, and 
Flat is due to the availability of fast on-chip memory. 
These registers are used in the execution of a sequential 
process. The small number of registers, together with the 
simplicity of the instruction set enables the processor to 
have relatively simple (and fast) data paths and control 
logic. The six registers are: 


The workspace pointer which points to an area of 
storage where local variables are kept. 


Bhe Instruction pointer which point to the next 
instruction to be executed. 


The operand register which is used in the 
formation of instruction operands. 


The A, B and C registers which form an 
evaluation stack. 


BS 


The Figure 2.2 (INMOSD86], shows these ee 3 
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Figure 2.2  Transputer Recusters 


The A, B and C registers are sources and destinaticns 
for most arithmetic and logical operations. Loading m -ESSE 
onto the stack pushes B into C, and A into B, berore mo MA 
A. Storing a value from A, pops B info A and C in: 

The instruction set has been designed for simple and 
efficient compilation OG high-level languages. A 
instructions have the same format, designed to given 
compact representation of the operations occurring mss 
frequently in programs. Each instruction consists cu 


- 


single byte divided into two 4-bit field 
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dr Se uga)ficant bits of the byte are a 
function code and the four least significant bits are the 


data value, as shown in Figure 2.3 [INMOSD86]. 


7 4 3 0 
Y 


| Cperand neg:ster | | 





ET Transpuuer instruction Format 


2. Processes and Concurrency 


Pipes SS res performs a number Of actions, and 
then either stops without completing or terminates complete. 

A  transputer can run several processes in parallel 
(concurrently). Processes may be assigned either high or low 
priority, and there may be any number of each. 

ne oaecessor Hassan creed scheduler which 
enables any number of concurrent processes to be executed 
together, sharing the processor time. This removes the need 


of a software kernel. 


ES 


At any time a concurrent process can be in one of the 
following states: 


Active -- Being executed 
-- On a list waiting to be executed 


Inactive -- Ready to input 
-- Ready to output 
-- Waiting for a specified period of time 

The scheduler operates in such a way that inactive 
processes do not consume any processor time. It allocates a 
portion of the processor's time to each processor. The 
active processes waiting to be executed are held in two 
linked lists of process workspaces, one for the low priority 
processes and one for the high priority processes. Each 
process runs until completion but is descheduled while 
waiting for communication from another process. In order for 
several processes to operate in parallel, a low priority 
process is only allowed to run for a maximum of two @eme 
slices (800 microseconds), before € is forcipis 
descheduled. 

The IMS T414 supports two levels of priority. The 
priority 1 (low priority) processes are executed whenever 
there are no active priority 0 (high priority) processes. 
High priority processes are expected to execute for a short 
time. If one or more high priority processes are able to 
proceed, then one is selected and runs until it has to wait 
for communication, a timer input, or until it compile 


processing. If no process at high priority is able to 
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proceed, but one or more processes at low priority are able 
Lo proceed, then one is selected. 

Low priority processes are periodically timesliced to 
provide an even distribution of processor time between 
computationally intensive tasks [INMOSD86]. 

3. Communications 

Communication between processes is achieved by means 
of channels. The process communication is pO Eco point, 
unbuffered and synchronized. As a result, a channel needs no 
process queue, no message queue and no message buffer. 

A channel between two processes executing on the same 
transputer is implemented by a single shared word in memory: 
a channel between processes executing on different 
Transputers is implemented by point to point links. The 
processor provides a number of operations to support message 
passing, the most important being input message and output 
message. The input message and the output message use the 
address of the channel to determine whether the chamel is 
internal or external. Thus the same instruction sequence can 
be used for both, allowing a process to be written and 
compiled without knowledge of where its channels are 
connected. The communications between two processes is 
established as follows: The process which is first ready 


must wait for the second one to be ready. 


T 7 


To be precise, a message is transmitted as a sequence 
of single byte communications; each byte is transmitted as a 
start bit followed by a one bit followed by the eight data 
bits followed by a stop bit. After transmitting a data bas 
the sender waits blote LIE ED acknowledge is received; this 
consists ok a sr Ae followed by a zero bit Me 
acknowledge signifies both that a process was able to 
receive the data byte, and that the receiving link is able 
to receive another byte. 

4, Timers 

The Transputer has two 32-bit timer clocks which 
"tick" periodically. The timers provide accurate process 
timing, allowing processes to deschedule themselves until a 
specific time. Also they are an excellent tool for 
programmers to use to evaluate the performance of networks 
and Communication timing! 

Two types of timers exist: one for high priority 
processes and one for low priority processes. The high 
priority timer is only accessible to high priority processes 
and is incremented every microsecond, having a full period 
of about 71 minutes. The low priority timer 15 mu 
accessible to low priority processes and is incremented 
every 64 microseconds, and has a full period of about 76 


Ou rs. 
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5. Memory 


The 2K bytes of static RAM provide a maximum data 
tate of 80 MBytes/sec with access for both the processor and 
HEDE S. 

The Transputer can also access 4 Gbytes of external 
memory space. Internal and external memory are part of the 
Sane linear acessa space ELE NEO 1S7 byte 
addressed, with words aligned on four-byte boundaries. The 
least significant byte of a word is the lowest addressed 
byte. 

The bits in a byte are numbered O to 7, with bit O 
the least significant. In general, wherever a value is 
treated as a number of component values, the components are 
numbered in order of increasing numerical significance, with 
the least significant component numbered 0. 

The internal memory starts at #80000000 and extends 
to #BOOOO7FF. User memory begins a 3800000048 and is 
referred to as MemStart. 

The reserved area is to implement link and event 
channels. Figure 2.4 L[INMOSD86], on next page shows the 


memory map of a T414. 
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Figure aa Memory Map 


6. External Memory incez c6 ce NE mes 


The External Memory Interface allows access to a 32- 
bit address space (4 Gbytes), Supporting dynamic ane saw 
RAM as well as ROM and EPROM. EMI timing can be configured 
at Reset to cater to most memory types and speeds, and a 
program is supplied with the Transputer Development System 
Lo aud in this configuration. There are 13 = 


configurations which can be selected by a single am 
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connection. If none are suitable, the user can configure the 
interface to specific reguirements. 

EventReq and EventAck provide an asynchronous 
handshake interface between an external event and an 
internal process. When an external event takes EventReq 
high, the external event channel (additional to the external 
link channels) is made ready to communicate with a process. 
When both the event channel and the process are ready, the 
processor takes EventAck high and the process, if waiting, 
is scheduled.  EventAck is removed after EventReq goes low. 

Only one process may use the event channel at any 
given time. If no process requires an event to occur, 
EventAck will never be taken high. 

1 Links 

The T414 uses a DMA block transfer mechanism to 
transfer messages between memory and another Transputer 
product via the INMOS links. The link interfaces and the 
Proeessor all operate concurrently, allowing processing to 
continue while data is being transferred on all of the 
links. The four links are identical, bi-directional serial 
ERG provide synchronization GE Communication between 
gaecessors and with the outside world. Each link comprises 
an input channel and an output channel. A link between two 
Transputers is implemented by connecting a link interface on 


Euw-ocransputer to a link interface in the other transputer. 
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Every byte of data sent on a link is acknowledged on 
the input of the same link, thus each signal carries Dema 
data and control information. Figure 2.5 shows ERR 


Transputer links. 





800028H 
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Transputers Links Memory Locations 


Figure 2.5 The Transpuiers Mini 


B. System Services 
The System Services include all the necessary le 
to initialize and sustain operation of the Transputer a 
also include error handling and analysis facilities sau 
are: Power, CapPlus, CapMinus, ClockIn, Reset, EGcor 


and Poke, Analyse, and Error. 
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B. THE BOO4 IBM PC ADD-IN BOARD 
The BOO4 Transputer board was used THI STE Ene 
n TIE of hold the I/O handler of the transputer network. 
It is depicted in the following lines. 
ME N EE ou CHEN ES for ene PC Add-in Board 
There are three main elements required for the PC 
board, and those are: 
a. A Transputer, with some external RAM 
b. The interface to the Personal Computer 


c. User controlled devices to allow the board to be 
used to control other similar boards 


Let's talk about the transputer and memory first. The 
T414 Transputer is a 32-bit processor with a processing 
ec pability of 10 MIPS. 

For the personal computer add-in board, it was 
decided to give the user up to 2MBytes external RAM, mapped 
ro the internal RAM of the T414. For this amount of RAM on 
an IBM form-factor board, dynamic RAM (DRAM) had to be used. 
Also, a parity check system was implemented. 

The communication with the host Personal Computer is 
handled using the C002 Link Adaptor; this device converts 
serial link data into byte-wide parallel data, and vice 
Em c The C002 allows simple interfacing with standard bus 
EGuPcectures, appearing to the host computer as a memory 
mapped peripheral. 

A number of system control signals are also provided 
which give the user the possibility of connecting a number 
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of Transputer boards to the add-in board via ss I 


allowing the add-in board to control a Transputer network. 
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Figure 2.6 Block Diagram of af 


Because of the Transputer programmable memory 
interface, we can configure the external memory cycle cy 
Lranspurer co be any width to suit slow and fast memory. 

Also a number of strobes were supplied which can be 
programmed to give refresh signals to DRAM (antem 
refresh over a selectable refresh cycle time can also DE 
chosen). This eliminates the need of timing generators 


The interface with the persona! sem ve E possible due Es 
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the communication between the PC parallel bus and the 
Transputer via one of the Transputer serial links. 

This method was chosen because it maps into the 
K Spuler concept of communications via OCCAM channels, 
i.e., the host computer appears to be as a process at the 
end of a channel mapped into one Transputer link. However, 
that also implies that the Transputer only use a channel to 
communicate with the host computer. 

To make Chas SOLE oiNENMnterrdce possible, were 
developed devices which convert parallel data into serial 
data, and vice versa to match with the channel protocol of 
the Transputer links. 

The aim of the system control functions is to 
initialize, and analyse errors in an arbitrarily large 
network of Transputers built with many boards. In particular 
a B004 board must be able to control many other boards in a 
rack such as in the EUROCARD BOX. 

C. THE BOO3 BOARD 

The IMS B003 evaluation board was the main unit used to 
build the prototype of the 16-transputer network developed 
En this thesis. 

It comprises four IMS T414 Transputers with 256 Kbytes 
of DRAM in each Transputers. The links provided with the 
evaluation board allow the Transputer network to be easily 


extended by connecting with other boards. 
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This board is capable of processing up to ao Ha ma 
data rate of its links is either 10 cr 20o M s 


The four  Transputers are connected in a ring as Sino a 


TIE 2 le 
link 1 link O 
| 
link O ink 2 link 3 link 1 
1 
link 3 link 2 
link 2 link 3 
link 1 ink 3 link 2 link O 
3 2 
link O link 1 
Figure 2.7 The B003 boere 
There are two links per Transputer which can Ba 


connected externally. Thus each B003 can be connected to 


four neighbor evaluation So rc 
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PITCEDESNMSN AND EVOLUTION 


CHE MODELING PROCESS 
Description of the Problem 

The problem chosen was the heat flow problem in a two 
dimensional plate and how this problem could be solved using 
globally distributed variables in a transputer network. 

This problem was selected because ME ds 
representative of many similar types of problems that arise 
in meteorology, science and engineering. 

The heat flow problem in a two dimensional plate is 


governed by the partial differential equation: 
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“EN specified initial and boundary conditions. 

To find the steady-state temperature distribution in 
the square plate, one side is maintained at some temperature 
which is called the hot end temperature, and the other three 
sides are maintained at 0 degrees (iced bath) as shown in 


ESSore 3.1. 
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Figure 3.1 Heat Conduction in a Square Place 


All internal points on the grid start also ra 
degrees. Also another element which is present in this 


equation is the propagation rate W, which is equal to 


At 
(l- 4r)/r where r= > 
AO 


The method of solution is to iterate through al? ar 





posmuss calculating a better approximation to ENS 


temperature at each point (i,j) in turn using the egu NU 


Tap Wt ajay Os n ME RS 


See (drw) 
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AS soon as a new value of T is calculated at a point, 
ME Levi valve is discarded. This is the Gauss-Seidel 
method of iteration. To start a temperature of 0 degrees is 
assumed everywhere within the plate. This Bec SEC 
Ms ac ion is repeated through all grid points until further 
iteration wculd produces, very little change and eventually 
no change in the computed temperatures. At this moment we 
have reached the steady-state solution, and we can assert 
mS this is the moment at which the iteration converges, 


by which we mean, if 


lim T(ij) (tm + 1) = Tj) 


tm -—--» c 


then our equation satisfies the discretized version of the 
Laplace's equation. 

Our finite difference scheme involves five points, 
four at time tm and one at the advance time tm + 1= tm + Dt, 
Mat allows us to "march forward in time". In this numerical 
scheme, the temperature at the next time is the average of 
the four neighboring mesh points at the present time, 
adjusted by the propagation rate W (relaxation parameter) 
Dueh is a function of the thermal conductivity coefficient 


of the material. 
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2. The Abstract model 
Our abstract model was defined without using a £0 WM 
specification approach. It can be seen asa black ia 
which a function operates ruled by the partial ditftrtern mum 
equation described above. The box provides the soo sy 
the steady state distribution of temperature in a Sam 
plate, with hot end temperature and propagation rate INP» 


as shown in Figure 3.2. 











Partial differential 


equation 









boundary 


conditions Solution heat flow 










T/ 27 a'r 
Voe zr 2 


dy problem in a two 


dimensional plate 


Figure 3.2 Abstract Model 


3. The Transformed Computational Medel 
The Transformed Computational model repre Cm i 
adaptation of the mathematical model to tne facil 
supported by the OCCAM programing languages in a modular 


fashion. This model is shown in the Figure 3.3. 
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Figure 3.3 The Transformed Computational Model 


At the bottom of Figure 3.3 we observe the processes 
executing. On the left side is located the I/O Handler which 
Non charge of supply to the Main Procedure with the 
Eoundarv conditions necessary for the correct operation 
during each new iteration. On the right side is the Main 
Procedure box which contains two pasic Blocks: Mme 
Eemmunication Block and The Calculations-Updating Block. 
The Communication Block is in charge of the maintenance of 
the interchange of messages with the I/O Handler and 


eventually with other neighbor Main Procedures. 


END 


The Calculations-Updating Block has the functions of 
calculating the new temperatures for time tm + 1 and also 


updating the values in the mesh points. 


B. NETWORK MODELS AND EVOLUTION 
1. Network Classification 

We can categorize our network prototype as a MIMD 
Transputer network, because we have interactions among the n 
Transputers which comprise the network, due to the fact that 
all memories streams are derived from the same data space 
virtually shared Dy sali Transputers. Also this MIMD 
transputer network is a loosely coupled one, because of the 
facilities created by the OCCAM programing language. 
In particular the input and output messages which use the 
address of a channel can determine whether an internal or 
external channel, is being used. Thus the very same 
instruction allows a process to be written and compiled 
without having knowledge of where its channels are 
connected. That is a Transputer does not need to have 
knowledge about its neighbors to operate properly. 
Our final stage will consist of a Transputer network of 16 
Transputers connected and operating in parallel to solve the 
proposed problem of the heat flow in a square plate. 

The type of arrangement chosen was a Torus Double 


Transitive Closure as can be seen on the Figure 3.8. 
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This type of network is also known as Regular Network 

[CAWE80] and its main characteristic are the following: 
ce The “tree” is a hierarchical structured variation with 
any processor able to communicate with its superior and 


EG ed nate as well as its two neighbers. 


b. If one of the Transputer fails we have redundant paths 
for single connected failure. 


sa vie cost or this network is relativelv nıch if we 
considered its computational power. 


eee cde medularity and expandability is poor. 


SEE iem ase is very high typically 3 to 5 MIPS, but 
using the Transputer, we can have higher performance. 


2. Model Evolution 

Initially we made the set up for one Transputer , but 
in order to compare the efficiency with a Transputer 
network, the model was expanded to an array of 2 X 2 , an 
ay of 3 X 3 and the final stage was a 4 X 4 Transputer 
network. 

First let's see the different models which were 
considered, why they were discarded, and why we choose our 
final prototype model. The Model I, was a system in which 
aie processes A, B, C, D, E, and F simulated the boundary 
Eeuditions and the numbered processes achieves the 
calculations to solve the problem. This model was discarded 
because for each line of Transputers, it had two Transputers 
doing nothing but serving to convey the boundary conditions 


“do extract the final solution of the problem. 


a5 


Also the communication verticais was very 


inefficient. The Figure 3.4 depict the model. 
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Figure j.4 "Models 


The Model II has the processes A, B, C, ana 
senders/receivers of boundary condicions. The main 
disadvantage of this model is that as we increase size of 
the network, we will need more Transputers to handle the I/O 
and boundary conditions passing, this model works Well 
a small number of Transputers, assuming one isinsi T 
use four Transputers to handle nothing but boundary 


conditions. This model is shown in Figure MoE 


34 





| 

T bb 
ng 

| 

| Ms 


Ms 


Figure 3.5 Model II 


Finally the model we selected, shown in Figure 3.6, 
is one which handles the boundary conditions better. We use 
one B003 for the one Transputer network, and make the other 
three Transputers transparent. The 2 X 2 network used all 
four tranputers in the board. For the 3 X 3 network we used 
four B003 using the same idea as for one Transputer in one 


B003 board, but now making transparent seven Transputers. 
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01 transputer network 04 transputer network 





09 trsnputer network 16 transputer network 





Figure 3.6 Model III in Its Different Sizes 


IS interesting to see how the flow of data is 
achieved in this model. Figure 3.7 shows how the boundary 
conditions and the start/stop signal are propagated through 
the network, as well as the data path follow by the 
solution, when it is sent back to the handler to be 


displayed on the screen. 
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In general this model was chosen because it provides 
the larger Transputer device Utilization without have 
widle or misemployed Transputers on the four and sixteen 
Transputer networks, and also because its symmetry permits a 


more even distribution of the communication load in the 


network. 
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Pigure Ddta Flow in the Network 
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3. The 16 Transbuker DEO O ce 
On Figure 3.8 we can observe the 16 transputer network 
ver its connected links, including these which 
communicates to the I/0 handler. The programs for each one 
of the transputer networks are contained in the Appendixes 
A, B, C, and D; the implementation of the modules are those 
programs, and they will be discussed in the next chapterzee 


the paragraph, Maximization of Software Performance. 





Figure 3.8 16 Transputer Network Prototype 
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wee pandability of the Model 


This transputer network can be expanded easily using 
the series, (2 + n)^2 A O O Bi a 
ms allows the cons rer oe rens peer networks 
E HET all the Pransputers available on the B003 boards 
Dueh os not the case if we get n= odd, then the Transputers 
that are left over must be made transparent in order to run 
the network. This practice however, makes the placement of 
the channels a job tedious and error prone. 

Appendix E is contains an expandable placement of 

Transputer channels following the above series for n even. 
Thus we can easily place with just change a number, networks 
EMO 36, 64, 100 ..... TÉransputers [TNMOSTNLES]. 
The way in which the external links were connected, 
including the links that joined the different B003 boards on 
the EUROCARD box, is displayed on Figure 3.9 for 01 and 04 
AE nsputer networks, and in Figure 3.10 for 09 and 16 
Transputer networks. 

The connecting box(es) shows the connections between 
puec various B003 boards which make up the Transputer 


network. 
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to link BOO4 


To set up the external links, Just 
match up the numbers using the 
twisted cable, provided with the 
boards. 


reset line 





B004 
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EUROCARD Box 


Figure 3.9 01 and 04 Transputer Networks Connections 
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Figure 3.10 09 and 16 Transputer Networks Connections 


40 





IV. EFFICIENCY CONSIDERATIONS 


A. INTRODUCTION 
1. Generalities 

Two related aspects of a parallel computer that 
affect run-time efficiency are the speed of computation and 
the speed of communication. The first relates to the design 
of the processor, its instruction set, and its organization 
(such as the use of a cache and pipelining) and its 
realization (such as the speed of its transistors). The 
second relates GO the interconnection network, the 
scheduling of its resources and the routing of information 
through it. This second aspect is less understood, and is 
the one in which different paradigms of parallel computers 
differ most. We focus on these two aspects, and propose that 
en. application be characterized by its communication 
requirements. Applications with similar communications 
requirements can be grouped together. For instance a pattern 
a cognition edge-detection problem can also be put in a 
Transputer network mesh structure and our two dimensional 
heat problem also can be put in a Transputer network mesh 
aanu cure. These two mesh structure problems have radically 
different computational requirements, but have the same 
communication requirements. We can study such network 
topology from the point of view of how well it handles a 
related class of Transputer network procedures. 


Al 


We agree that a large Transputer network can be built 
to solve large problems, then we will submodel that problem 
into two models; one model considers how a very large 
Transputer network can be built, and how a MIMD Transputer 
network process can be expanded within it, to determine 
whether doubling the number of Transputers assigned to that 
problem will speed up its execution by a factor of tea 
might get linear speedup if that were true. (This ideal 
situation is not easy to achieve, unfortunately). Those 
results about linear speedup OF Transputer network 
procedures are very important since we need good procedures 
for Transputer networks. The other model which is complement 
of the first assumes that the problem size will remain fixed 
and the machine will be larger and larger inductively. That 
is, the problem may be run on one Transputer, and the 
machine might be expanded from one to sixteen Transputers, 
and we will consider the efficiency of running the problem 
on the same one Transputer. 

This model is easier to study, since rather simple 
and general statements can be made on it. 

It is quite useful in understanding the overall 
model, since expanding a Transputer network system to solve 
a bigger problem can be done by fixing the problem and 
expanding the machine first, then expanding the problem to 


fill the machine. 
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In this thesis we will devote correspondingly more 
time to studying the model which shows how a given 
Transputer network process can be expanded within a large 
machine to determine if increasing the number of processors 
assigned, we can get linear speedup, also some reference and 
results related to how the fixed sized problem  behaves when 
the Transputer network system in which it runs is expanded 
Eneuctively [LIMIST7]. 

EE Ucrminology andGOEcepts 

We want a suitable set of definitions to evaluate the 
quality of our architecture. Because of that, a notion of 
"energy" is given besides the traditional concepts used in 
engineering for the efficiency study. 

a. Power and Energy 

The computational energy for a process is the 
maeseuct"Or the computational power (bit rate able to be 
generated by the hardware of the 'Transputer) and the time 
the hardware is needed, where the computational power 
includes all the output necessary to run the processes and 
Bae time is the product of the length of the clock cycle or 
in other words is the time required for computation and 
eemmunication. 

To clarify those concepts let's see an example, 
suppose we have a network with four Transputers like the 
case of the networks that can be implemented using a IMS 


BOO3, then if each Transputer has a computational power of 
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10 MIPS we can then assert that our network comprised of 
four Transputers will have a computational power of 40 MIPS. 
Therefore if we have a module running a process, we can use 
N identical modules to execute the same process (as N 
Transputers) but having available N times the computational 
power of one module. 

When we expand our  Transputer network in an 
inductive way, we call each Transputer that we add a unit of 
computational power (UCPs). 

In the evolution of our 16 Transputer network 
prototype, we pass through the 3 X 3 network which is 
assembled using 4 boards B003, then in this topology we find 
a special kind of  Transputer which is transparent sai 
neutral unit. It does not compute and only has the tas Ww 
moving data in and out of the network or simply devan 
nothing as the Transputer located at the right-lower cor m 
These modules cannot be classified as UCPs, so we call them 
blocked UCPs, and these will be considered when we evaluate 
the Transputer network in the next chapter. We also take 
these into account when we measure the total amount of 
computational energy necessary to run a Transputer network 
process. The Time is also an interesting concept, and it 
includes all the components of the time needed to execute a 
Transputer network process. We will break the time in two 
main blocks; the communication and the calculation Ew 


These two blocks are very well defined in our Transformed 
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Computational Model from Chapter III and also they can be 
seen on any of the programs from the Appendices. 
b. Efficiency 

The usefulness ofa computer is indicated by the 
efficiency it exhibits in the execution of processes on it. 
This is the obvious definition for efficiency; now we will 
define relative efficiency as well as the concept of 
equivalent process necessary to understand the relative 
efficiency. Later a relation between relative efficiency and 
input computational energies will be stated. 
The relative efficiency of two computer systems executing 
equivalent processes is defined as the ratio of the 
efficiencies of the two systems in executing the process, 
where two processes are equivalent if they provide the same 
outputs when given the same inputs, (which is clearly our 
case in the network). Therefore we can define the efficiency 
of a computer system in executing a process as the ratio of 
input computational energy (ability to generate bits from 
the modules) to the output computational energy (information 
of theoretic bits produced by a module). 

From this definition we can state that 
the relative efficiency of two computer systems executing 
equivalent processes is inversely proportional to the ratio 
of input computational energies of the two computers 


[LIMI87]. 
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3. Applications of Eftficyenc EDEN IE 
So far the reader probably has some doubts about the 


concept of efficiency and that it is critical fonw 
analysis that we present in the next chapter. Thus tol ma 
some light, let's use it to analyze some issues to show its 
veil vg 

First of all, we will consider the simple 1d ms 
serial-parallel conversion, which leads to the notion of 
speedup. Before we do that we will classify the erin 
analysis in two types: first order analysis which ignores 
communication and control, focuses on computation, ar 
second-order analysis which considers all these factors. 
Then the analysis that we use to determine if a procedure is 
capable of linear speedup, may be a first-order analysis and 
to understand the real world we will need to apply a second- 
order analysis. In Figure 4.1 we can observe the classical 
comparison between parallel and pipelined processors, this 
is a simple notion which has been manipulated by theorists 


for many years. 
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Figure 4.1 Energy for a Serial and Parallel Adder 


If we examine the relative efficiencies of a serial 
and a parallel adder (Fig. 4.1), in which the computational 
power of the adder cell is much more greater than that of 
mee control and communication circuitry that support the 
adders (i.e., the calculations are more time consuming than 
the communications), therefore we will ignore these factors 
(first-order analysis). The energy for a 3 bit serial adder 
id tor a 3 Dit parallel adder is shown in the Figure, in 
the serial adder we have one unit of hardware used for three 
units of times and in the parallel adder we have three units 
Of hardware being used for one unit of time, then clearly 
the areas are the same and so are the relative efficiencies. 
This simple procedure shows the notion of linear speedup. If 
the number of UCPs is multiplied by N then the time to 
execute the procedure is reduced to 1/ Nth, or the speed is 


increased by N. 
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It should be noted that linear speedup is equivalent 
to constant computational energy. This, type of analysis 
will be used extensively in the next chapter, when we 
perform the compo r t ve evaluaticn of the differs 
networks. The results are misleading in some architecte 
because it does not consider the changes in computational 


energy due to the communication and control. Nevertheless 


Fh 


the analysis carried out on tbe different prototypes was o 
the type second-order, because the communication time was 
include in the total time. 

From the notion of linear speedup and conversion 

serial to parallel we can realize about the secondary 
importance of the speed as figure of merit in a topology. 
A parameterized architecture based on a single procedure as 
addition is capable of considerable speedup. For instance 
12-bit add can be done one bit at a time in 12 time steps, 
or 12 bits at a time in one time step. Within limits, Iy 
possible to squeeze the time dimension of an energy area as 
the power dimension is increased to get constant area. 

The degree to which parallelism can be exploited to 
get speed depends on the amount of data to be processed! 
However the limit to the speedup is given by the smallest 
size of the unit of computational power (i.e., indivis'b' 
and this is the fundamental idea why the researchers are 
interested in fine AF am rather than large grain 


parallelism. 
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ukene Oie EA et go further than a Turing 
machine. Within these limits, time can be traded against 
power. Thus the speedup is not a fundamental figure of merit 
for a parallel architecture. The more fundamental figure of 


merit in a parallel architecture is the efficiency. 


B. MAXIMIZATION OF THE TRANSPUTER NETWORK 
1. Generalities 
This section will describe how to obtain better 
performance from a Transputer network (array type). However 
only very general guidelines can be given, because this area 
is still on active research and our solutions tend to be 
specific to our problem. 
2. Maximizing link performance 
The Transputer link is an autonomous DMA engine 
capable of sustaining a bi-directional data rate of 20 
Mbits/sec. However in our prototype we are using 10 Mbit/sec 
as the common data rate. The higher rates can be used 
without seriously degrading the performance of the 
processors. To achieve a maximum link throughput the system 


links and the processor must be kept as busy as possible. 
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Following are some suggestions for achieving the 
maximum chrouchpuee 
a. Decoupling communication and computation 
To avoid the links waiting for the processes 
vice versa, link communication should be decoupled from 
computation. For example, it is inelficient eo tt code 
like the following 
SEG 
JI dara 
compute(data) 
oUt ara 
because we are forcing the Transputer to perform one action 
at a time, as inputting, computing, and oucpúucl ir a 
solution is doing the three things at the same time using a 
couple of buffers into a parallel constatei 
PAR 
butter( Ja a) 
compute (a tm) 
buffers os), 
b. Gather together all the communications processes 
This can be seen in the communication blocks of 
the diverse designed prototypes. The communication process 
must also be wrapped into a PAR construct. If possible ma 
also recommended to put this PAR package inside a PRI PAR 
running first or at high priority, the communis e 


package. 
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c. Large link Transfers 

when we set up a transfer down a link, the set up 
itself takes about 1 microsecond. Once the transfer is 
initiated, it will proceed autonomously from the processor, 
Pen umIing typically 4 processor cycles every 4 microseconds. 
Thus the idea is to keep the message as long as possible. 
However, long data transfers also increase latency when data 
must be transferred, which occurred in our case for the 16 
Transputer network prototype. To solve the problem we used 
the optimal message length in all the topologies developed, 
including the final model of 16 Transputers, which used 
between 10 and 100 bytes [SIHA88]. 

d. How the boundaries were passed in the network 

The problem of the boundaries exchange was 
approached in the following manner: The basic idea was to 
send and receive by all the channels available, and if the 
information (boundary) was not necessary, we just do not use 
it. It may appear inefficient but for purposes of creating 
homogeneous processes, we favored this option. This gives a 
uniform communications package, allowing a better measure of 
the performance to be obtained. The boundaries were one 
dimensional linear arrays with a maximum length of 24 
integers (One Transputer network) and a minimum length of 6 


integers. 


Sl 


Figure 4.2 shows how this sequence of evento mar me 
Once the communications are achieved the different 
boundaries are stored in linear arrays called dummies, then 


the processes decide whether to use them or not. 





| Dummy arrays 
boundary conditions 
arrives before 


they are used or 
discarded 





out 








boundary 


in 


Figure 4.2 Boundary Exchange 
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MOD UEARTIY OF TRE SYSTEM 

The modularity of this type of system is poor [CAWE80]. 
The main issues that conspire again the modularity of each 
of tne procedures were the routing code for the start/stop 
i and che routing code to extract the final information 
cm tne network. Even considering this little difference in 
the implementation of each module, we still preserve the 
E urmuctures tor the Communication and Calculation Block 
identical. We call these two blocks the main data structure, 
which allows us to see the Transputer network as a system 
with virtual shared memory by duplicating the information in 
Bach main data structure which is in turn a block of memory 
on each Transputer. 

The routing codes are different, however because most 
of the Transputers in the network have to perform a 
Ss erent Joo to assure the transmission of the start/stop 
Signal and to flush the results out of the network. 
For instance Transputer number 0 which is at the upper left 
corner, has to receive and send to the I/O Handler 15 arrays 
of temperatures plus S Gma array m in contrast to 
Transputer number 3 which is at the lower left corner, and 
which only has to send up its own array the moment after the 


reception of the stop signal. 
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V. COMPARISON OF NETWORK PERFORMANCE 


The main reason to build parallel computers is eeu 
able to solve larger problems or to solve the problem 
faster. 

This chapter focuses on the central theme of IR 
thesis. We have described so far a parallel computer 
(Transputer network) prototype which has been implemented in 
an inductive fashion. Briefly, an inductive architecture is 
one that can execute a number of jobs proportional to the 
number of processors, and the energy needed for each job is 
proportional to a sublinear function of the total number. 
processors. Thus a relatively large process, as the one used 
in this thesis (heat flow problem) whose procedure exhibits 
linear or nearly linear speedup, can run efficiently “enan 
whole network if it has an inductive architectus 
It is also convenient to comment that the experimental 
results obtained from the different Transputer networks were 
conducted using off-chip memory data. This provides 
worst case evaluation and all the results are under the same 
general conditions. 

A. ARE WE USING AN INDUCTIVE ARCHITECTURE ? 
After the above lines and before get into the efficiency 


subject, we think it is good to verify his pm 
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PAG Nut architecture is indwetive if: 


ccc is a basis architecture, and all architectures 
O NL components Ene. are units of the basis. 
For us that is certainly true, since the basis 
architecture is represented by only one Transputer, and 
the other architectures contain nothing but the same 
Ier wINeHWSUEDSCTUTTSaBSDUter.: 


There Es an” inducticn mechanism that can expand an 
rel tes Ure iremos CO N+l UCPs. That also can te 


seen in PA AC me See che basis 
architecture on the ert and the expanded architeczure 
sesong TOL a Simple 2v AN mesh. The inéuczien 
NEehcmesmesoolv "deus" Transteers around the perimeter 
of the mesh to increase the number of UCPs from N^2 to 
(Nr 1) 22. 








O1 TRANSPUTER 04 TRANSPUTERS 








09 TRANSPUTERS 16 TRANSPUTERS 





Figure 5.1 The Inductive Mesh Architecture 


Therefore we can assert that our expanded model is an 


C Live architecture. 
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B. EFFICIENCY EVALUATION 

In general the efficiency can be increased by using a 
better procedure, a faster technology or processor. In this 
thesis, using the inductive property of our architecture, we 
will not change the technology, procedure or processor, but 
we will use a variable number of identical processors 
(Transputers). What we have done in this evolution, or 
better, induction of the basis model is to fix the size of 
the problem. That is, we are solving an array of 24 by 24 
elements and executing it on more UCPs or Transputers; our 
goal is to show how this  Transputer network process runs, 
without seriously decreasing in efficiency. Then from our 
experimental results we can see in Figure 5.2 a picture, 
which is pretty much the same as the one used to describe 
the linear speedup concept in Chapter IV; the sizes of the 
UCPs differ a bit from the original basis, but this is“ 
to the fact that we are using a second-order analysis in 
which the communication and control overhead is considered, 
and of course larger than for only one processor running the 
same process. In this Figure on the left, the area of the 
rectangle is the energy to execute the process in one 
Transputer, and on the right and the bottom we can observe 


the same for the other inductive architectures. 
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In Table 5.1 we have a summary of the results. 


We can 


observe that the computational power of each network 


famine rementead as expected by factor of 4, 9, and 16 in 


relation to the value of the network of one Transputer. 


time 


30082 


05.74 


02.33 


01.08 


Sec 


Sec 


Sec 


Sec 


TABLE 5.1 


PROLOLYPE ENERGY “RESULTS 


computational power # of Transputers 
310,519 bit/sec 01 
1,400,102 bit/see 04 
2,829,484 bit/sec 09 
4,949,449 bit/sec 16 
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Ol transputer computational energy C4 transputer computationa. energy 
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Figure 5.2 Efficiency om r m 


The values for four and nine Transputers are, as 
explained before, a little bit above the expected because 
the communication and control overhead, but in them Ra 
Transputer architecture we see that now the computational 
power to run the process is a little bit less than the 
theoretical calculated value, which will be 4968304 bit/sec 


(310519 x 16). 
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— aaa SEE» DD weal 


The reason for that is referred to Chapter IV, on the 


EE ap “the applications of efficiency analysis", in this 


EE 


Case our architecture is entering in the fine granularit, 
zone so the degree at which the parallelism is being 
exploited is superior to the two former cases; also we can 
EE Or Our inquetcive model” the atomic size of the UCP 
# Tor an array of DE E AA SAKS, |n malchPene array 
of temperatures we are deal with is only 3 4 X 4 elements. 
Bevond this point we cannot continue diminishing the size 
because the Transputer process simply does not work. 

From Chapter IV we remember the definition of 
efficiency; it was the ratio of input computational energy 
to output computational energy; and also we should realize 
that the efficiency factor is very low because we have the 
Erz put information of the process divided by the information 
delivered by the hardware modules in the time necessary to 
solve the problem (i.e., time to steady state in our case). 
In Table 5.2 we can see how the efficiency is improved in 
relation to the network basis of one Transputer. For this 
calculation we recall that the input computational energy of 
the system is equal to the Time times the computational 
power, and the output computational energy is equal to the 
maximum data rate for the Transputer which is 1024 x 1075 


bits/sec [INMOSO86], times the Time. 
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TABS 


EFFICIENCY COMPARISON FOR THE NETWORKS 





in.cp.energy out.cp.energy effi. ratio # Transpe 
5255765000 2570195258 0.0030 01 
587776000 80365852. OO key 04 
238592000 6592609. 72 0.02 6 09 
110592000 5345404.92 0.0483 16 


AS can be expected as long as we are entering on the 
fine granularity zone, the efficiency of the system is 


improved. 


C. “RELATIVE EERICIENCY 

Another measure that we performed is the relative 
efficiency of running our Transputer network procedure in 
the different systems. 

From the definition we know that the relative efficiency 
of two computer systems is the ratio of the efficiencies of 
the two systems executing the same process. This results are 
resume in the Table 5.3, on which we take the higher 


efficiency as base to compare the others again it. 
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PASTE 5,3 


EKATE VESEFETCTENCY 


basis efficiency = 0.0483 


| relative efficiency for 01 Transp. network = 6.21% 

| 

| relative efficiency for 04 Transp. network = 28.36 % 
ED. MA 35 


relative efficiency for 09 Transp. networx 
| 
| 


On this Table we can realize SE the efficiency of the 
one Transputer network, is about 6.21* the efficiency of the 
sixteen Transputers network, and so on for the others 
networks. 

Mil relative efficiency is plot in Figure 5.3. We 
observe a plot of the efficiencies, related to the highest 


efficiency presented by the sixteen Transputers network. 
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D. TRADITIONAL APPROACH TO SPEELUP ES ier, 

The speedup that our system is capable of achieving can 
be graphically determined using the traditional method ias 
is outlined now. We know from before that if we have a 
parallel computer with N equivalent processors running in 
parallel on a problem, it will be N times faster than a 
single processor running the same process. Certainly this is 
the ideal case, but in the reality the speedup of a system 
ranges from a lower-bound of lg(N) to an upper-bound of 
N/ln(N) [KAFA84]. The lower bound is known as Minsky's 
conjecture. Using this conjecture MEE WC C only sl a 
speedup of 2 to 4 from our four and sixteen Transp cem 
networks. In the other case we have a better estimate of N/ 
ln(N). For the latter case let's get through the estimation 
and subsequently plotting process. We can say that the 
process at the one Transputer network is running in al 
of time, Tl= 1. Let Fi be the probability of assigning 
same problem to i processors working equally with an average 
load Gies ye per processor. Furthermore assume equal 
probability of each operatinalmodel sig j: processors, 
that is Pis 1/N, for N operating modes : i= 1 O NM 
Then the average time required to solve the problem on an N- 
processor System is given below, where the summation 


represents N operating modes. 





The average speedup S is obtained as the ratio of Tl= 1 
o 4n: that is “Fr 2427. Then in che Figure 5.4 
we observe the plot of these upper and lower bound plus the 


ideal case and also we can see our result. 
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Figure 5.4 Various Estimates of Speedup and our 
Results 


In this plot we can observe, that as we enter in the 
fine granularity zone, due to the meduciion. ins the 
communications overhead and computational time, we are 
exploiting the parallelism in a more efficient fashion, and 


obtaining a better speedup. 
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B. SOME DETAILS 

There are some conditions about this evaluation and some 
observations that are necessary to explain and which can 
serve as hints for future investigation. 

First, during the evaluation of the different networks, 
there were automatic ways of setting up to evaluate. That 
is the processes were loaded on the Transputer network and 
when they were ready with the data, they stopped the 
processes themselves and displayed the information on the 
screen. Although this look like a fairly good way to save 
time, in our particular case, the method was discarded 
because it introduces an overhead in communications which 
would bias the accuracy of the measurements. 

Second, the programs were implemented using the Type 
INTEGER for all the arithmetic operations. It allows a 
program to run faster and also the comparison tim g 
establish the "steady state" condition was less than if we 
had used the Type Floating Point, which from the comparison 
resulted much more time consuming than the Integer Type, as 
expected from the OCCAM programming Language specifications. 

Third, once the programs were implemented, there were 
other paths of investigation, such as the one in which the 
problem size was augmented to run on a 4 X 4 Transputer 
network, giving an overall array of 96 by 96 elements. In 
this case the results showed an improvement; i.e., an 


increasing in throughput was observed. 
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ie reason is Empler but not Subtle; in this case the 
Hi 6 ement of the performance was due to the fact that the 
Danber of computation per unit of time was increasing by a 
square factor, while the overhead in communications grew in 
a linear fashion, therefore we were again diminishing the 
Size of the grain. This point will be discussed later in 


Bil chapter. 


F. COMPARATIVE THROUGHPUT 

The throughput is another type of performance measure 
that can be recorded. The throughput in our system 
represents the number of results per unit of time that our 


system can achieve. Table 5.4 gives us a summary of the 


pesults. 
TABLE 5.4 
THROUGHPUT RESULTS 

| 

array size # transp. throughput | 

24 x 24 01 40511 results/sec | 
2 1 04 206580 results/sec 
SS 09 392535 results/sec 

6 X 6 16 671824 results/sec | 





Also we can do a relative comparison between the 
efficiencies as we did before with the efficiencies 
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determined from the computational energy of the system, and 


certainly, as can be expected, 


same. The 


DE Sr 


these 


TABLE 55 


values are much the 


summary of this information is recorded on Table 


THROUGHPUT AND RELATIVE THROUGHPUT 


[ TO Too rm 


| array size # transp. rel. throughput 
| 24 X 24 01 6.30 % 
| jJ 2 04 50%! S206 
x 8 X 8 09 58. 20 
6 x 6 16 ---- (*x*) 


(*) the overall array size is the same 


GR. 


basis throughput 





G. “THE OPTIMAL ZONE 


We know that the idea of reducing the granularity in a 


parallel architecture is the main focus of the research 


today, but conversely there is a practical limit cng 


little computational power can be used to execute a process 


related to the cost of the hardware and the threshold time 


to execute the process. In other words, it appears to be 


ideal to break up the problem into smallest possible 


components For parallel execution, but Chan fine 


partitioning can in practice be too costly in terms s 
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overhead and cost of the hardware. For instance, we will be 
underusing a powerful microprocessor as the Transputer, to 
solve a very little problem product of this partition, and 
also when we partitioning a problem very finely, we get more 
time consumed to communicate data between Transputers, thus 
slowing down the production of results, and not gaining any 
improvement in performance. Therefore we have to find a way 
to balance the communication and computation in a effective 
manner. To that end, the answer is to get a more relatively 
coarse partitioning, i.e., get a tradeoff between the 
maximum number of processors that can be feasibly employed 
to solve the problem and the time constrains of the problem 
itself. The idea is to find what we have called the "optimal 
zone", and operate our machine in it in order to have 
maximum performance and consequently the best efficiency. 

In our sixteen Transputer network prototype, we have a 
system comprised by many small internal fast memory 
processing elements or Transputers, that communicate each 
other relatively fast through the splendid  Transputers 
links, thus this architecture lends itself to fine grained 
problems. These expectations were confirmed from our 
experimental results. Another way to approach the problem is 
to fix the number of processors and reduce the granularity 
by using a larger array. We use this method in the four 


network prototype and the sixteen network prototype. 
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We decreased the granularity using software, i.e., we 
increased the size of the array of temperatures and we 
observed and recorded the behavior in relation gU 
poU DUE. 

The testing ies Was performed on these two 
architectures was to run the programs, changing grant mwan 
Steering wich a EIE coarse grain, i.e., We LLLI 
temperature array of 4 elements and we incremented its sie 
up to 24 elements, and we were recording and calculating the 
different throughputs for each different problem size. Thus 
we could observe the throughput start to increase 
continuously from the minimum size, and then stabilize ae 
an array size of 14 x 14 elements, for an array ork i 
elements (not shown), this behavior still holds. It is true 
the throughput increased greatly, but on the other hand, the 
time to solve the problem also increased. Here we han 
tie our performance to timing constrains. From this wel ' 
deduce the existence of the optimal zone for this type of 
architecture. To illustrate these concepts we can sese 
Table 5.6 the results of throughput for different Gr 


sizes on both architectures, and in Figure 5.5 we can see a 


plot of these results. 
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TABELE 906 


THROUGHPUT RESULTS FOR DIFFERENT GRAIN SIZE 


array Ad Transp. MESI Ranson 

grein size eros in throughput in 
size elements results / sec results / sec 
extrem.large 04 62500 250000 

very large 06 DEE DI 444432 
large 08 je! 473680 
medium 1:2 136869 543472 
transition 14 145161 580640 

fine 16 ds AS 532608 

fine 18 140350 56] 92 


very fine 24 144047 576176 





throuphput in bits per second 
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Maximun troughput 
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Iu gure 5.5 


Throughput for Different Grain Size 
in a 04 and a 16 Transputer Network 
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H. HOW THE PARALLELISM WAS ACCOMPLISHED 

So far, we discussed throughput and speedup of the 
different Transputer networks and we have proved from the 
experimental results the existence of parallelo C T EEE 
Now let's consider the parallelism in more detail to have a 
clear idea Of what 2 os 

We must recall that we hav^ = Transputer network process 
running making use of the virtual sizred memory system. This 
virtual shared memory is obtained by duplicating information 
in local memories so that when a producing process writes a 
new value into its Local memory, the synchronous operating 
system generates a message which is broadcast to all 
consumers of the data via the point to point link mechanism 
of the Transputers. Thus the local memory of each com oi ea 
node (Transputer in the network) contains the duplicate data 
ready to be consumed by each consumer in its local memes 
[KOD88]. The reading and writing is accomplished in every 
complete cycle of communication and calculation, and is 
executed in a carefully synchronized fashion so chama 
writing of the data structure by a producer «is comp 
before that data structure is read by the consumer [REKA79]. 
In our heat flow problem this sequence of events occurs in 
the following way: suppose we map an imaginary grid over the 
plate denoting at each line intersection a Transputer which 
is in charge of calculating a square segment of temperatures 


for the pilare. 
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AS Soon as some process is ready with the updating of 
its set of temperatures due to a previous boundary exchange 
with its neighbors, it proceeds to calculate the new 
temperatures, updating its internal array of temperatures 
Mpadating its local memory, represented by the data 
structure which contains the array of temperatures). It is 
then ready fora new cycle, which always starts with the 
boundaries exchange (write in and read from the local memory 
of its neighbors). This last action cannot be seen as a 
local activity which only affects the state of the neighbors 
S Chis process but as a kind of chain reaction which is 
propagated in vertical and horizontal sense all over the 
network, creating the so called virtual shared memory 
effect. We can observe that assertion in Figure 5.6 on next 


page. 
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| For simplicity only a row of the network is displayed, but this transfer 
| of boundaries, which in turn represent the update mechanism, must be 
observed as a simultaneoua process in both directions, (up--down and 


right-- left). 


Figure 5.6 Memory Updating Mechanism in the Network 


Let's describe what we mean with chain reaction "a 
more precise way: suppose at some instant of time the 
process O receives and sends (writes in its local memory smu 
writes out the surrounding local memories) the boundar 
from/to its neighbors. The following processor (or mannii TEE 
neighbor on bottom or right) let's call and locate i af 
right, process 01 which does exactly the same to jis aie 


2 


and the next process receive and send these boundaries 
behaving in the same way until we arrive at the end of the 
row, which we call the end for the sake of the illustration 
of the concept. In reality if we look in more detail, we 
shall agree that this end of the row does not exist, because 
the last Transputer is physically connected to the first 
Transputer in a closed loop. Moreover, this movement of data 
to the right is also registered in the opposite sense 
concurrently (from these notions were established the name 
of "double transitive closure"). Thus we can assert that at 
any instant of time each Transputer in the network updates 
or writes into the local memories of the other Transputers 
in the network due to a kind of interactive total exchange 
of boundaries. In other words, when  Transputer O0 receives 
the boundaries from Transputer 01 at its right, it is 
receiving not only the effect of the boundary temperatures 
of this  Transputer but also the effect of boundary 
temperatures in Transputer 02, and Transputer 03, and so 
gorth in a concurrent fashion, yielding a kind of 


instantaneous daisy chain transmission. 
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We can assert that the time in which memory was last 
updated in process 0 due to the data produced in process 3 
at the end of the row is the very same as the time for 
updating of memory in process 3 due to the data produced» 
process 0. In the timing diagram of Figure 5.7 is show in 
concurrent activity Map Hie sixteen  Transputer network 
prototype, using for the sake of simplicity only a Fones 
the array. It must be remembered that the activity occurs 
concurrently in a vertical and horizontal sense, in maai 
left and top to bottom directions, and vice versar 

The symbol C stands for calculations and the symbol D 
for updated data value. During the first complete aai 
process O updates its data value, receiving information via 
link2 from process 01 and process 01 at the same time 
receives this information for its own consumption from 
Transputer 0 via link3. This activity is performed 
concurrently. At the same time, process 01 does the same for 
process 02, and process 02 for process 03. After that, we 
observe a parallel calculation activity in the fourth 
process, which will last, at a maximum, the time which 
takes the last process to achieve its calculations. This 
does not means that the next iteration will be delayed by 
any processor calculation other than the process 0 
calculation, which 15 ing b r K start the cyelen 
Therefore the calculation activity of the slower process may 


overlap in time with the updating data time of the other 


74 


processes for the next updating cycle, and the other way 
od mi: is the mpdating activity of a process may 
overlap with the calculating activity of the other process, 
EU the bottom line for this overlapping is that it is not 
possible to perform an updating activity which belongs to a 
determined cycle with the calculations data of the same 


cycle. 





D D1 1 D2 
Proc. 2 E oe E EE 


| 
D 
x Proc. 3 0 CO D1 C1 D2 C2 


TIME ——— N 


The Parallelism is easily observed by the overlapping in time 


of the update and computational periods of the different 
processes. 





Fielies Timing Diagram 
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I. THE TIMING CONSTRUCT 

To finish, we wish to describe the timing construct used 
to obtain the measurements. 

The T414 Transputer has two timers; a high prior 
timer with a resolution of one microsecond and a cycle time 
of about 71 minutes, and a low priority timer which ma 
resolution of 64x 10^ -6 seconds and a cycle time of 37 
hours. The timer used was the low priority timer, and the 
type of construct was an elapsed time construct to determine 
the elapsed time from start to finish of some activities 
within the process. The basic structure of this conse 


can be seen in Figure 5.8. 





Declaration of Variables 
Timer clock: 
INT timel,time2,timetest: 
SEQ 
clock 2 timel 
x timing code 
| clock time? 
Benn tan more code | 
timetest :- time2 - timel (final result) 








Figure 5.8 The Timing Consta 


Essentially the timing construct has two variables 
integer type, (time2, timel) which are used to store the 
value of the Timer and a third integer variable called 
timetest which give us the difference , which is the value 


of interest. 
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VI. CONCLUSIONS AND RECOMMENDATIONS 


ES CONCLUSIONS 

The conclusions that we obtain from our observations 
during this research were as follow: 

First, the effects of parallelism in the networks were 
proved practically and theoterically. 

Second, the existence of an optimal zone related to the 
Franularity Of the system and time constrains of the problem 
was predicted in theory and deduced from the experimental 
results. 

Third, the degree of parallelism attained in these 
networks is quite remarkable, as shown in the Figures due to 
speedup and efficiency. For example, in the 16 Transputer 
network prototype Vs Ol TS Or NS by o array” of 
temperatures a throughput of 671824 results per second. 
Considering the fact that we perform 7 arithmetic operations 
per result, (5 additions, one division and one 
multiplication), that gives us 4,702,768 integer operations 
per second. Also should be taken under consideration that 
because the fact of the implementation "march forward in 
time", was necessary to Copy the entire ar r ay (Gam 
temperatures into a temporary array which is later 
transferred to the real array of temperatures, thus that 
represents an overhead which slows down the process 
Significantly. 
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Fourth, the improvement in performance is a trade off 
between the number of processors (Transputer) added to the 
network and the granularity on one hand, and on the other 
hand, the cost of the hardware and the time constrains of 
the problem. 

Ep The Transputer network is an architecture 
comprised of many small internal fast memory processing 
elements that communicate to each other through the powerful 
Transputer links. Thus this architecture lends itself to 


fine grained problems. 


B. POSSIBILITIES OF THE TRANSPUTER 

At the beginning of this thesis some guidelines about 
the importance of the Transputer were given. 

The real importance of the Transputer lies in the fact 
that it represents a new level of abstraccion ayw 
physical design of information systems. As we know so far, 
there have been two levels of abstraction: 

1) the electronic component, in which the information is 
represented in terms of electrical signals, like voltage or 
capacitance, and 

2) the logical gate, in which the information 
represented by logical levels, so the electrical details 
have been abstracted from the design process. 

The  Transputer offers a third level of abstracts 
based on language, where the basis unit is the word, which 
can be given specific semantic connotations by the provision 
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Eier appropriate set of information operations. Therefore 
the Transputer chip will be used as time goes on in much the 
same way as the discrete transistor was used about 20 years 


ago. 


C. RECOMMENDATIONS 

Bear in mind that the fundamental research reason of the 
AEGIS modeling group at the NPS, is to develop a suitable 
replacement the older architectures on board the Ticonderoga 
class ships. It is recommended that rather than broadening 
the Transputer Laboratory to cope with this function, the 
research should be divided into specific smaller projects 
which help to implement the new system. This recommendation 
is basically due to the limited availability of resource for 
a small group like this. 

Another recommendation is to seek for feasible research 
projects related to weapons that can be developed by the 
Group. 

ai al O portante to continue the trend of this 
emesis in following the exploration of this type of 
architecture and the production of software for it. 

Et will be interesting to see how this type of 
architecture can handle problems as weather forecasting for 
a particular weather model. Finally is important to continue 
e earch in the field of graphic applications, especially 
that which pertains to the study of Chaotic Systems such as 
Mandelbrot and Julia sets. 
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APPENDIX A 


01 TRANSPUTER NETWORK SOURCE CODE 


— < Q m. m emm emm emm f IF emm om emm ms emm  “ F V T? emm emm — emm o — OS c] eo emm n, ms m ms m m, m, om U s s s 


"USE “eo: Vtdsiolib serie af 
VAL linkOout IS 
VAL Linklour His 
VAL Link Zen TE 
VAL ink gow. as 
VAL Tink ra IS 
VAL linklin IS 
VAL link2in IS 
VAL link3in DSTI: 

CHAN OF ANY leftin,rightout,antirightout,antileftin: 
PLAGE leftin AT ink in: 

PLACE ridhteut AT linkse 

PLACE anti. gncCouc ATTIN out 

PLACE antileftin AT link2in: 

LOOB sum me 

VAL s IS 11: 

VAN ese fo 223: 

VAD g TS 333; 

VAL size IS 24: 

INT w tag he NOZ, EZE: 

[size] INT temp: 

[size] INT recp: 

[Size] INT recor: 

[Size] INT recp2: 

[size] [size] INT rulsvo: 


-J] Ow Ci ;» C0 L = O 


SEO 
no:=0 
write.full.string (screen, " Enter the hot end 


temperature ") 
read.echo.int (keyboard,screen,he,no) 
newline(screen) 
JO: =Ü 
write.full.string (screen, " Enter the propagation 
rate ") 
read.echo.int (keyboard,screen,w,no) 
newline(screen) 


SEQ 
SEO r = 0 FOR SiZe 
SEQ 
temp [r] := 0 
recp [ARE 6 
tecpl [] s0 
recp2 [Y] := 0 
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SEO r - O FOR size 
temp [r] := he 
tag:= 9 
ambr ONI! tag vw. temp 
uen Gu k'ir t ag temp 
AN UF GAME GEED? 
pene ole tro p! 
turning := TRUE 
SEO 
WHILE turning 
PRI ALT 
keyboard ? z 
SEQ 
TE 
Z = esc 
SEQ 
tag:s S 
antileftin ? recp 
deitini seep 
antirightout ! tag;w;temp 
rightout ! tag;w;temp 
ancsilecrt mn Cru lle 
SEDE == 0 FOR Size 
SEQ 
SEO c = 0 FOR size 
SEQ 
CO ME] fe] 
write.int (screen,txt,4) 
newline(screen) 
turning := FALSE 
newline(screen) 
TRUE 
STE 
antileftin ? recpl 
SEQ 
leftin ? recp2 
am Peron cone tag, w, temp 
KIGACOUE tag. w: Lemp 
antirightout ! recp2 
rightout ! recpl 
newline(screen) 
write.full.string(screen, "Type ANY to return to TDS") 
INT any: 
read.char(keyboard, any) 


om 


VAL EE EE 
VAL Linklout Sis 
VAL link2out IS 
VAL link3out IS 
VAL linkOin TS 
VAL linklin IS 
VAL Wink 2 IS 
VAI Tinko or TS 
[9] CHAN OF ANY channel,antichannel: 
PROC central.node(VAL INT engine, CHAN OF ANY 
leftin,topin,richtın bortor vs 
leftout ,topout,rightout, Dot sons ns 
“USE "er\tdsliolibsuser loar ur 
BOOL active : --Declarations 
VAL s TS Ll: 
VAL. d TS 135. 
VAL size IS 24: 
INT tad tp m: 
[size] [size] INT square: 
[size] [Se INI Ca LeUk 
[size] INT dummyo: 
[size] INT dummyl: 
[size] INT sendero: 
[size] INT senderl: 
[size] INT sender2: 
[size] INT sender3: 
WHILE TRUE 
SEO -- Array initialization 
SEQ r- O FOR size 
SEQ C- 0 FOR SIZE 
SEQ 
square r|m C] 
calcul r hen 
SEQ r= 0 FOR Size 
SBO 
dummyO [r] := 0 
dummyl [r] := 
sender0 fr] 
senderl [r] 
sender2 [r] 
sender3 [r] 
active:= TRUE 
n:= engine 


WOU + uN O 


° 


H H 
O 


O 000 


8 2 


WHILE active 
SEO 
TP: 
ns 0 
SEO 
or tog summit 
EITE tac 


IF 
tag= s 
SEQ 
active:= FALSE 
topout ! square 
TRUE -- Communication block 
SEQ 
PAR 
leftin ? dummy0 
op Gg dummy 0 


panne” dummy 0 
bottomin ? dummy0 


leftout ! senderO 
topout ! senderl 
rightout !  sender2 


bottomout ! sender3 
SEO r = 0 FOR size 
SHO 
sguare[r] [0] :- dummyl [r] 
SEQ r = 1 FOR size - 2 
SEGE - FOR Size - 2 
SEQ 
tp:= ((w * square [r] [C] ) + 
ae ka] [E=I]) square 
[r] [C * 1] square[r-1][c] 
Um dare Ea 1] [cC] 
)))) Z (4 * w) 
call [rx] [fé] := tp 
SEO r = 0 FOR Size 
calcul [r] [0]:= square[r] [0] 


square := calcul 
SEO r = 0 FOR size 
SEQ 


senderO [r]: 
senderl [r]: 
sender2 [r]: 


square[r] [1] 
square[1] [r] 
square[r] [size - 


sender3 [r]:=square[size - 2] [r] 
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PROC transp.horizontal (VAL INT engine, CHAN OF ANY 
leftin,topin,rightin ,bottormlin, les 
topout,rightout, bood i 


HUSE "c:Ntdslolipxuser ES IN 
BOOL active: 

VALTS TS ZI 

VAL g Te 333: 

VAL size IS 24: 

INT Lag wn: 

[size] INT speel 

[size] INT spec2: 


WHICE TRUE 
SEO 
SEQ r = O FOR size 
SEO 
Spec! pr] 2o 
spec2 [r] :- O 
n:= engine 
active:= TRUE 
tag:= g 
WHILE active 
SEO 
Te 
n= 2 
SEQ 
Leftin -Stag 
TE 
tag= s 
active:= FALSE 
TRUE 
SEO 
erin speel 
rightin? spec2 
leftout ! spec2 
rightout ! specl 
n= 3 
SEO 
leftin ? tag 
IF 
tag = s 
active:= FALSE 
TRUE 
SKTE 
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PROC transp.vertical (VAL INT engine, CHAN OF ANY 
ET RA EED IN, right in, Dottemin, 
NE re cDouL,rlgnromnct 
bottomout ) 


CURE ve scasdolibljuserio.tsr'": 
BOOL active: -- Variable declaration 
VAL sS IS 11: 
KAL NSS > 
VAL size IS 24: 
a TC] Waal 
Mai ze] INT speci: 
Se TIN eS 
WHILE TEUR 


SEO 
SEQ r = 0 FOR size 
SEQ 
Speci jr] := 0 
Speez r] := 0 


n:= engine 
active:= TRUE 
tag:= g 
WHILE active 
SEQ 
TE 
n= 1 
SEQ 
bottomin ? tag;w;specl 
grantout ! tag 
TE 
tag- s 
active:- FALSE 
TRUE 
SEQ 
bottomin ? specl 
topout ! specl 
topin ? spec2 
bottomout ! spec2 


YE 


PLACED PAR 
PROCESSOR 0 T4 
PLACE channel [0] 
PLACE channel [1] 
PLACE channel[2] AT link2in: 
PLACE channel[3] AT link3in: 
PLACE antichannel[0] AT link0out: 
PLACE antichannel[l] aT nene: 
PLACE antichannel[2] AT link2out: 
PLACE antichannel[3] Ar 115 oun 
central.node(0,channel[0],channel[1],channel[2],channe NP WE 
antichannel[0],antichannel[1], 
antichannel[2],antichannel[3]) 


AT linken: 
ATT] ine 


PROCESSOR 1 T4 
PLACE channel[4] 
PLACE channel[5] 
PLACE channel[3] AT link2out: 
PLACE channel[6] AT link3in: 


AT m 
AT e 


PLACE 
PLACE 
PLACE 
PLACE 


PLACE 
PLACE 
PLACE 
PLACE 
PLACE 
PLACE 
PLACE 
PLACE 


antichannel[4] AT linkOout: 
antichanñnel l 5) AP lin ion 
antichannel[3] AT pin ool: 
antichannel (6 | sr lin oon l 


transp.vertical(1,channel[5],antichannel[3],channel[6], 


channel[4],antichannel[5],channelf3], 
antichannel[6],antichannel[4]) 


PROCESSOR 2 FA 


channel[ I ATO Tn O Dn 
channel[0] AT linklout: 
channel[8] AT link2in: 
channel[2] AT link3out: 
antichannel[7] AT link0out: 
antichannel[0] AT linklin: 
antichannel ps PAT nen: 
antichannelfa] sin om 


transp.horizontal(2,antuchannelp2].channe MD 


antichannel[0],channel[8], 
channel[2],antichanne n N 
channel[0],antichannel[8]) 
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PROCESSOR 3 T4 
PLACE channel[5] AT linkOout: 
PLACE channel[7] AT linklout: 
PLACE channel[6] AT link2out: 
PLACE channel[8] AT link3out: 
PLACE antichannel[5] AT linkOin: 
PLACE antichannel[7] AT linklin: 
PLA Ant icnsnnellié) AT link2in: 
PLACE antichannel[8] AT link3in: 
r ns er ERAS antichannel[6],antichannel[8], 
ancilecnannel psi antichannel[7],channel(/6], 
channel[8],channel[5],channel[7]) 
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APPENDIX B 


04 TRANSPUTER NETWORK SOURCE CODE 


PROC input.handler (CHAN OF ANY keyboard,screen) 


This procedure handles the input and output from the 
Transputer 
network. 
#USE m"c:Ntdsiolipsuser:iesrsr 
VAL linkOout IS 
VAL Linklout TIS 
VAL |1ink2eët STO 
VAL link3out IS 
VAL linkOin TS 
VAL linklin IS 
VAL link2in IS 
VAL link3in TS 
CHAN OF ANY leftin, rightout,antirightout, antilerersss 
PLACE leftin AT link3in: 
PLACE ridhtout AT LinkOut: 
PLACE antiridhtout AT JURE SUE 
PLACE antilettin AT 1ink2in: 
EOOE do turning: 
VAL s IS 11: 
VAL "esc TS 223: 
VAL g IS 333: 
VAL size IS 12: 
INF w,tag nes uo z counters 
[size] INT temp: 
[Size] INT recp: 
[Size] INT recpl: 
Size" TNT 'recCcp2:; 
[S12ë] [size] INT trulv0: 
[Size] [Size] INP il e 
[size] [size] INT truly2: 
[size] [size] INT truly: 
size) [size] INTEX: 


--- Variable 
--- Declarations 


JW EMM O 


SEO 
no:=0 
write.full.string (screen, " Enter the hot end 


temperature") 
read.echo.int (keyboard,screen,he,no) 
newline(screen) 

=0 
write. full.string (screen, " Enter the propagatiam 
rate ") 

read.echo.int (keyboard,screen,w,no) 
newline(screen) 
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SEO 
XEM 
SEQ 

temp [r] 
recp [r] 
He Col E 
recp2 [r] 
SEQ r = 0 FOR 
temp [r] := 
tag:- g 
anesxrcghtowt |! 
r LO Ile m! 
aridiridgheeut ! 
Fignzeeurser recpil 
turning := TRUE 
SEO 
WHILE turning 
PRI ALT 


O FOR si 


si 


keyboard ? 


SEQ 
EE 
Zz = 


he 


ze AE FEN SEN 


0 
O 


0 
0 
ze 


tag;w;temp 
tag;w;temp 
i E I 


re 


a 


esc 


SEQ 


Printing the temp. 
array 


SEQ 
tag:= s 
aneslertjin,? recp 
leftin ? recp 
ans ae out l tag; w, temp 
BIOHESUT ! Eag:w: temp 
counter := 0 
umo muc bruly2: 
ind DOES 
WHILE counter « 4 
SEO 
SEQ r = 
SEQ 
SEO C = 
SEQ 
tx:= trulyo 
Ex Se: 
write.int (screen, 
Lab) 


0 FOR size - 1 


0 FOR size - 1 


SEQ 1 = 1 FOR size - 1 
SEQ 
pr iud» 


e enkt L] 
write.int (screen, 
BES 
newline(screen) 
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SEQ r = 1 FOR size s 


SEQ 
SEQ d = O FOR size “mu 
SEO 
tx:= trui 
txt:= t>” [r JP m 
write.int (screen, 
TEER 
SEQ h = 1 FOR size 
SEQ 


LR EN AE GAN 
txt:= Ex cas 
write.int (screen, 
CXC, p 
newline(screen) 
counter:= counter + 4 
turning := FALSE 
newline (screen) 
ante tctrm 2o%recpi 
SEQ 
leftin 2 recpo 
antirightout ! tag;w;temp 
Fidhrout tag woe 
ante irigheone eee 
rightout EES 
newline(screen) 
write.full.string(screen, "Type ANY to return tolas 
INT any: 
read.char (keyboard, any) 
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VAc IIinkoout IS O 
WAL linklouc IS 1 
RAR link2zoUt . IS 2: --- Chanrel declaration 
Han link3cut IS 3: 

DAL linkoin TSA 

WAL danklain ES 5 

VAL link2in TS: 6; 

MAD linkin ES! 7”: 

[9] CHAN OF ANY channel ,antichannel: 


— r mI emm emm emm om emm hr m emm emm am I ama emm am am ann avè mm mm mm mm mm mm m6 a omg emm emm emm emm  " mn ES emm emm ES EN j w a US a w s an a 


PROC central.node(VAL INT engine, CHAN OF ANY 
Werftin, topin, 


it in, bottomin,lseftout, topout,rightout, 
bottomout ) 


tusis "c: Vtdsiolibluserio.tsr": 
BOOL active 
VAL s IS 11: 
VAL g IS 3338 
VAL size IS 12: 
INT taq,w, tp, ae 
[Size] [size] INT square: 
[size] [size] INT calcul: 
[size] INT dummyo: 
[size] INT dummyl: 
[size] INT dummy2: 
[size] INT dummy3: 
[size] INT dummy4: 
[size] INT sendero: 
[size] INT senderl: 
[size] INT sender2: 
[size] INT sender3: 
[size] [size] INT temporal: 
WHILE TRUE 
SEO 
SEQ r= 0 FOR size 
SEQ c2 0 FOR size 
SEQ 
square [rJ ke] 
sal ae] [e]: 
temporal [r] [c] 
SEO Ee O FOR size 
SEO 5 EEO 
dummy0 [ -- Initialization 
dummyl [ 
[ 
[ 


l Il 
loo 


dummy 2 
dummy 3 


EIS OO 


Sn 


dummy4 [r] :2 O 
senderO [r] : 
senderl [r] 
sender2 [r] 
sender3 [r] 

active:= TRUE 

n:= engine 

WHILE active 

SEQ 
TE: 
n= 0 
SEQ 


H HUH 
eoo c 


topin ? tag;w;dummyl 


puc cou we 
IP 
tags S 
active — 
TRUE 
SEQ 
PAR 
left ras 
CODINA? 
EC LD a 
bottomin ? 
Ere! 
EOpeuUr 
Figntoucg:! 
bottomout t 
SEO = 0 FOR 
SEQ 
sqguarefri 


tag;w 


FALSE 


dummyO --Communication 
dummy4 -- Block 
dummy 2 

dummy 3 

sender0 

senderl 

sender?2 

sender3 

size 


[0] :- dumay! (eg 


square[r] [size - 1] :- dumy2 
[Y] 
square[size - 1] [r] ze dummy3 
[r] 
SEQ r = 1 FOR ar 2e zz 
-- Calculations 
SEQ c = 1 FOR Size - 2 
SEQ 
tp:= ((w * square [r] [cC] ) + ( 
square[r][c-1] + saam 
[r] [C + 1] + ( Square wan 
[c] + square [E + 1 ie 
)))) / (4 * w) 
calcul fr] [el eco 
SEQ r = 0 EGF ce 
calcul [r] [0]:= square |r llu g 
square := calcul 
SEQ r = 0 FOR size 
SEQ 


sender0 [r]: 
senderl [r]: 


- 


SQuUare r 
square[l] 


[1] 
[r] 


sender2 [r]: 
sender3 Tri: 


square[r][size-2] 
square[size-2][r] 


n= 1 
SEQ 
bottomin ? tag;w;dummy 3 
rienteuter! cad w 


Jem 
tag= s 
active:= FALSE 
TRUE 
SEO 
PAR 
Teftin ? dummy 0 
TOPIN ? dummy 1 


matin ? dummy 2 
bottomin ? dummy4 


leftout ! sendero 
torde sender1 
i Four ! — sender2 


|pestomout- ' sender 
SEQ r = O FOR size 
SEO 
square[r] [0] dummy3 [r] 
square[0] [r] dummyl [r] 
sguare[r] [size-l] :- dummy2 [r] 
ster =P FGORWSIZE — 2 
SEQ c = 1 FOR size - 2 


SEQ 
tp:= ((w * sguare[r][c]) 
area] [Ic ll] + 
(Saque e es A + 
sua repe=lL | [e] + 
square [r * 1] [c] )))) / 
(4+w) 
elem ed ie] =p 
SEO p=) 0 FOR size 
calcul [r] [0] := square [r] [0] 
square := calcul 
SEQ r = O FOR size 
SEO 


SEASON | 
sender1 [r] 
sender2 [r] 


Square [r] [1] 

G qu [KE 

square[r][size- 
2] 

sender3 [r] := square [size - 2] 


[r] 
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n= 2 


SEQ 
leftinzzrac,w 
TE 
tag= s 
active:= FALSE 
TRUE 
SEQ 
PAR 
leftin eg dummy0 
HOE EG dummy 1 


right im. dummy?2 
bottomin ?  dummy3 


leftout ! sendero 
topont | senderl 
ETOMEOUE ender 


bottomout ! sender3 
SEQ r O FOR size 


SEQ 
sguare[r] [0] := dummy0 EM 
square[size -lj[r] := dummy3 [r] 


SEQ r = 1 FOR size “w 
SEQ c = 1 FOR Size - 2 
SEQ 
tp:- ((w * square[rj[c]) an 
square ([r] [C-l]+( sean 
[r] [ce + 1] + (sq mw 
[r=-1][C] r sqüarce MEME 
[C] )))) / (4 + w) 
calcul [2] [el 5 


Square := calcul 
SEQ =r = 0 ron size 
SEQ 


sendero [r] 
senderl [r] 
sender Mr 
sender3 (r] 


square [rd EM 
square [1] NEM 
sguare[r]([(sSize =, 
square [size] a 


[r] 
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jo 
SEO 


leftin ? tag;w 


TE 


TRUE 


tag- s 
active:= FALSE 
SEQ 
PAR 
leftin ? dummy0 
LOPIE? dummyl 


rure e dummy 2 
bottomin ? dummy3 


Jeftout..! sender0 
tOpPouUE senderl 
rightout ! sender2 


bottomout ! sender3 
SEQ r = 0 FOR size 
SEO 
Square[0] rr] 
square[r] [0] 
SEQ r = 1 FOR size - 
SEO c = 1 FOR size - 2 
SEO 
tee sguare[r] [cC] ) + ( 
Su edit iet] T square 
BEIC CIPIT Squares 77 
Ru quarc [r EE AI el 
))))/(4 + w) 
calcule] <= tp 


dummyl [r] 
dummyO [r] 
2 


square := calcul 
SEQ r2 0 FOR size 
SEQ 
sendero [r] := square [r] [1] 
senderl [r] := square [1] [r] 
sender2 [r] := square[r][size-2] 
sender3 [r] := square [size - 2] 


[r] 
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TE 
n=0 
SEQ 
bottomout ! square 
rightin 2 temporal 
bottomout ! temporal 
n=2 
SEO 
leftout ! square 
n=3 
SEQ 
rightout ! “square 
n=1 
SEG 
topin ? temporal 
bottomout ! temporal 
topin ? temporal 
bottomout ! temporal 
bottomout ! square 
leftin ? temporal 
bottomout ! temporal 


PLACED PAR 
PROEES OR 6 Ta 

PLACE channel[0] AT linkOin: 

PLACE channel[1] AT linklin: 

PLACE channel[2] AT link2in: 

PLACE channel[3] AT link3in: 

PLACE antichannel[0] AT linkOout: 

PLACE antichannel[l] AT linklout: 

PLACE antichannel[2] AT J1nKZeu 

PLACE antichannel[3] AT link3out: 

central.node(O,channel[0],channel[l],channel[2], 
channel[3],antichannel[0],antichannel[1], 
antichannel[2],antichannel[3]) 


96 


PROCESSOR 1 T4 
PLACE channel[4] 
PLACE channel[5] 
PLACE channel[3] 
PLACE channel[6] 


Fr |inkOIN: 
AT Mn k in: 
ARE Z OU E : 
AT Vink Sin: 


PLACE 
PLACE 
PLACE 
PLACE 


antichannel[4] AT linkOout: 
antichannel[5] AT linklout: 
anicdchannelps) AT link2in: 

antichannel[6] AT link3out: 


central.node(1,channel[5],antichannel[3], 
channel[6],channel[4],antichannel[5], 
channel[3],antichannel[6],antichannel[4]) 


PROCESSOR 2 T4 


PLACE 
PLACE 
PLACE 
PLACE 
PLACE 
PLACE 
PLACE 
PLACE 


AT linkO0in: 

Pr EE rie NG GE: 
channel[8] AT link2in: 
channel[2] AT link3out: 
antichannel[7] AT link0out: 
STEET Er EE 
antichannel[8] AT link2out: 
side tr 


channe1[7] 
channel[0] 


central.node(2,antichannel[2],channel[7], 
antichannel[0],channel[8],channel[2], 
antichannel[7],channel[0],antichannel[8]) 


PROCESSOR 3 T4 


PLACE 
PLACE 
PLACE 
PLACE 
PLACE 
PLACE 
PLACE 
PLACE 


central.node(3,antichannel[6],antichannel[8], 
antichannel[5],antichannel[7],channel[6], 
Channel [sq channel [5 channel (77) 


AT linkoovr: 
ATA LAK Tout: 
channel[6] AT link2out: 
channel[8] AT link3out: 
antichannel[5] AT linkOin: 
antichannel[7] AT linklin: 
ant channet[6] AT Link2in: 
antichannel[8] AT link3in: 


channel[5] 
channel[7] 
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APPENDIX G 


09 TRANSPUTER NETWORK SOURCE CODE 


QUSE "c:Atdsiol'ib mse ue: 
VAL linkOOouto us 0r 

VAL |inkieutr! as 
VAL link out Ma 
VAL Link Out n 
VAC n ae dem LS 
VAL link im ÉS 
VAL link2in IS 
VAL -linkin no 
CHAN OF ANY EE ie rightout,antirightout, antite fis 
PLACE leftin AT link3in: 

PLACE rightout AT Link Jowt: 

PLACE antirightout AT link2out: 

PLACE antileftin AT link2in: 

BOSE IO WEILGI. 

VAL Susa 

VAL esc lo 220: 

VAL es O: 

VAL size IS 8: 

INT W,tag,he nojz,counber counted Hi 

[succ DNI bene. 

[Size] TNT reco: 

[size] MT EED: 

[Size] INT recp2: 

[Size] [size] INT truly: 

[9][size] [size] INT true: 


OU ION 


SEG 
NO d 
write.full.string (screen, " Enter the hole 


temperature ") 
read.echo.int (keyboard,screen,he,no) 
newline(screen) 
no:=0 
write. full.string (screen, " Enter the propagan 
rate "| 
read.echo.int (keyboard, screen,w,no) 
newline(screen) 


ING, -- Array initialization 
SEO r = 0 FOREST = 
SEQ 
temp [r] := 0 
recpÀ [r] : 8 
retp EEN 
FScCB2 mu 


SEO r = 0 HORSE 


OG 


temp [r] 
tag:= g 
ancirightourt 
Ecoute L 
amerr icheoue 
Lc COME r 
tT DC = T 
SEO 

WHILE turn 

PRI ALT 
keyboa 
SEO 

TẸ 


:- he 


! tag;w;temp 
ag: w; Lemp 
DEO 2 
EXE 
RUE 


ing 


Ede? - 


Z - esc 
SEQ 
SEO 
tag:- S 
antileftin ? recp 
Samin? Eecp 
curie siehts c cad:w: cemp 
nou ESE W  'Eenp 
0 


counter := 

counterl := 0 

WHILE counter < 9 
SEQ 


apellefein 2 truly 
SEQ h = 0 FOR size 
SEQ p = 0 FOR size 


true [counter] [h] [p] := 
Emu aq 
counter :- counter + 1 
SEQ 
SEQ r = O FOR size - 1 
SEQ 
SEQ c = O FOR size - 1 


SEQ 
Ek. a tEuelcsounteri] (Fr) 
[c] 
write.int (screen,txt,3) 
SEO 1 = 1 FOR size - 2 
SEO 
txt:- true [counterl + 
EN 
write.int (screen,txt,3) 
SEQ d = 1 FOR size - 1 
SEO 
txt:- true [counterl + 
eie Ed 
Weite. ınces (screen, txt,3) 
newline(screen) 
counterl:= counterl + 1 
SEO r = 1 FOR size - 2 
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SEO 
SEO c = O FORMS TZ K IM 
SEçEQ 
txt:= true [counterl] 
[r] [SW 
write.int (screen, tx eas 
SEQ I = 1 FORENT 
SEQ 
txt:= true [counter 
3] ET 
write.int (screen L ` Dp 
SEQ d = I FOR size MM 
SEQ 
txt:= true [counter E: 
6) [EIEN 
write. int (screen, ExEDaN 
newline(screen) 
counterl:= counterl + 1 
SEQ r = 1 FOR size Ml 
SEQ 
SEQ C = 0 FOR si EE MI 
SEO 
txt:= true [counter WM 
[r] Fa 
write.int (screen,txt,3) 
SEQ 1 = 1 FOR size o 
SEQ 
txt:= true [counterl + 
3] (EMM 
write.int (screen,txt,3) 
SEQ d = 1 FOR size - 1 
SEO 
txt:- true [counter mE 
6] [r] tem 
write.int (screen,txt,3) 
newline(screen) 
counterl:= counterl + 1 
curning: S EAL E 
newline(screen) 
antileftin ? recpl 
SEQ 
leftin 2 mee 
antirightout ' tag :witemp 
rightout ! tag;w temp 
antirigheout ilor 
rignheoue | recpi 
newline(screen) 
write. full.string(screen, "Type ANY to return to Ta EE 
PIL sanya 
read.char(kevboard, any) 
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VAL 


[35] CHAN OF ANY 


linkOout 
rm Jou t 
res OE 
nas ou E 
IT 
VET IN 
re? n 
da ms in 


H 
u 
OU PUNDHO 


-- Channel declaration 


channel,antichannel: 


central.node(VAL 
kettcin, topin, 


PROC 


INT 


engine, CHAN OF 


mcn bottoemin,1eitout, topout,rightout, 
DOCEOMOUE) 


#USE 


NENAS IS El: 
NALNG TS 333: 
VAL size IS 8: 
IND tag, w, COTTI 


ze) 
[size] 
[size] 
[Size] 
[Size] 
Ne 
[Size] 
RENE 
[Size] 
[Size] 
Eu e 
[size] 


[Size] INT square: 
[Size PAINT calcul: 
[size] INT temporal: 


INT 
INT 
INT 
INT 
TNF 
INT 
INT 
TNE 
INT 


WHILE TRUE 


SEQ 


SEO: 


dummyo : 
dummy l: 
dummy 2: 
dummy3: 
dummy 4 : 


sendero: 
sender]: 
sender2: 
sender3: 


O FOR size 


SEO 
square [r] [cC] 


EE 


SEQ 


salsa E) ' 
cemporal [r] 


O FOR size 


dummy0 
dummy 1 
dummy 2 
dummy 3 
dummy 4 


SEO ‘c= 0 FOR size 


c] 
[ 


ooooeo 


c] 
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¡OO 


“Ee: ds ed HE wser io Csr": 


BOOL active --Variable and array 


declaration 


senderO [r] 
senderl [r] 
sender2 [r] 
sender3 [r] 
active:- TRUE 
n:- engine 
WHILE active 
SEQ 
TR 
n= 5 
SEO 
leftin ? Ege N 
rigntour caes 


Il Hu HI ll 
OOOO 


TE: 
tag- s 
SEO 
active:- FALSE 
topout ! square 
bottomin ? temporal 
topout ! temporal 
TRUE 
SEQ 
PAR 
leftin ? dummy O 
topin ? dummy 1 


Fighting: dummy 2 
bottomin ? dummy3 


leftout ! senderO 
topsute t senderl 
rightout ! Sender? 


bottomout ! sender3 
SEO r - OCHOBRESUZO 
SEO 
Square | O MB 
square [|e janes 


dummyl [r] 
dummyO [r] 


square[r][size -1] := dummy2 [r] 
sguare[size -l][r] :- dummy3 [r] 
SEQ r = 1 FOR sien Do 
SEQ C = L FOR size -2 
SEO 
tp:= ((w * square [r] ana 
( Square [r mr NS 
(square [r ][c E 
("square [r-1 ea 
square [r - 1] [c] ))9)9X 
(4 + W) 
calcul [t] [ea] See 
square Sjelen 
SEQ r = G' EOR saye 
SEO 


sender0 [r]: 
senderl [r]: 


Square [TIN 
square | Der 


oO 


sender2 [r]: 
sender3 [r]: 


square[r][size- 2] 
square[size-2][r] 


— E t 2 E E E NS emm E E s S s s sQ sQ Q s Q s S S Q s Q Q Q s sQ s Q Q Q Q Q Q S Q S Q S S s Q E NS SE Q — Y NES a= 


PROC corner.node(VAL INT endine, CHAN OF ANY leftin,topin, 
Ee HEG sem IE FEOUL , EOpout ,EiAhtout, 
Dottomout) 
see: vide io lib userio.tsrYs 
BOOL active 
VAL S IS 11: 
MARY IS 333: 
VAL size IS 8: 
NNI EQ w, Cp, n, coúntero: 
[size] [size] INT square: 
#s1ze] [Size] INT salen: 
[size] [size] INT temporal: 
[size] INT dummyO: 
[Size] INT dummyl: 
ize] INT dummy: 
[Size] INT dummy3: 
[size] INT dummy4: 
[size] INT sendero: 
[size] INT senderl: 
[Size] INT sender2: 
[size] INT sender3: 
WHILE TRUE 
SEQ 
SEQ r= 0 FOR size 
SEQ cz O FOR size 
SEQ 
square [r] [cC] 
calcule [ec] 
nee E] 
SEQ r= O FOR size 
SEQ 
dummyO [r] 
dummyl [r] 
dummy2 [r] 
dummy3 [r] 
dummy4 [r] : 
senderO [r] 
Sender [E] 
sender2 [r] 
Sender3 [r] 
active:= TRUE 
n:= engine 


I Il 
oo 


iow Hl H l 
OOOOO 


O O O O 


O3 


WHILE active 
SEO 
LIG 
ns 0 
SEO 
topin ? tag:y dun] 
tigntouk die 
bottomouc ! tag;w;dummyl 
TE 
tags s 
SEO 
counter0:= O 
active:= FALSE 
COPS UE aaa 
WHILE Counter r m 
SEQ 
bottomin ? temporal 
topout ! temporal 
counter0 :- counter0 + 1 
WHILE counterO < 8 
SEQ 
rightin tempo 
topout ! temper 
counter0 :- counter0 t 1 
TRUE 
SEO 
PAR 
Jertin ? dummy 0 
top ND. dummy 4 
ENEE dummy?2 
bottomin ? dumny) 


lef tour sendero 
Copom! senderl 
rightout !  sender2 


bottomout ! sender3 
SEQ r = 0 HORE 
SEQ 
square[r] [0] := dumm M I 
square[r][size- 1] dummy2 [r] 
square[size - 1](x]:= dummy? ma 
SEQ r =71 FOR size 
SEQ C = 1 FOR self 
SEO 
tp:= ((w * square ([r][C ]) ra 
square [r] [c-l] + 
( square [r] [C +W 
squaré [r-1] [ei 
square [r + 1J][cCc])) R 
(4 + w) 
calcul [rJ jc jp se 
SEQ r = 0 FORENE 
calcul [r] [0]:= sgüare [r REN 
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square := calcul 
SEQ r = O FOR size 
SEQ 
sender0 [r]: 
sender1 [r]: 
sender2 [r]: 
sender3 [r]: 


seuarejırır [pe 
Sen (EI 
square[r][size- 2] 
square[size-2] [r] 


n- 2 
SEQ 
Bot comin 2? Ktag:w;:dummy3 
Y Ou GW 


TE 
tag= 
SEQ 
active:= FALSE 
ope square 
TRUE 
SEG 
PAR 
leftin ? dummy0 
topin ? dummy 1 


Renee > dummy?2 
bottomin ? dummy4 


leftout ! sender2 
TOPQut ! senderl 
rightout ! sender2 


bottomout ! senderl 
SEQ r = O FOR size 


SEQ 
square[r] [0] := dummy3 [r] 
square pr] -= dummyl [r] 
square[r] [size -1]:- dummy2 [r] 


SEQ r = 1 FOR size - 2 
SEQ C = 1 FOR Size - 2 
SEQ 
p II s sSsqüare [r][C] ) + ( 
Seus re F 
drs C I] F 
See me (ELI [Ce] fF 
sdüare lir t 1] 1c] No) Z 
(4 + w) 
Selle o e e cp 
EO -s O FOR Size 


temas pon = square fr] [0] 
square := calcul 
SEO Ele O FOR size 
SEO 
senderl [r] := square [1] [r] 
sender2 [r]:= square[r][size-2] 


205 


ns 8 
SEO 
teftin EE 
Figohtoui WE 


e 
tags S 
SEQ 
active:= FALSE 
leftout ! square 
bottomin ? temporal 
leftout ! temporal 
bottomin ? temporal 
leftout ! temporal 
TRUE 
SEQ 
PAR 
teten 2 dummy0 
COPAS dummy 1 


EJOHUPIDEE dummy 2 
bottomin ?  dummy3 


jerctout ! senderO 
topout ! sender3 
rightout !  senderO 


bottomout ! sender3 
SEQ E = O FOR size 
SEO 


square[r) [0] "e dummyo [r] 
sguare[size - 1][r]:= dummy3 [r] 
SEO r = 1 FOR size yo 
SEO C =- 1 FOR Size pw? 
SEO 
tp:= ((w * square[r](€l + ( 
square [r] [c-1] + 
( squares | ( 
square tr Su 
square [r + 1] [c] )))) / 
(4 + w) 
cadcm EN 
square := calecul 
SEQ r = 0 POR Size 
SEO 


senderO [r]: 
sender3 [r]: 


square [FIN 
square[size-2] [r] 
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ns 10 
SEO 
deken” cag ;W 
TE 
tag= s 
SEQ 
active:= FALSE 
topout ! square 
TRUE 
SEQ 
PAR 
leftin ? dummy0 
ESPINAS dummy 1 
Ede ede dummy?2 
bottomin ?  dummy3 


leftout ! sendero 
Eopouc ! senderl 
rightout ! sendero 


bottomout ! senderl 
SEO r = 0 FOR size 
SEO 
square[0] [r] 
square[r] [09] 
SEQ rT = 1 FOR Size 
SEO C =I FOR size - 2 
SEO 
tp:= ((w * square [r][c] ) + ( 
Square r [cC 1) + 
( square [r] [c + 1] + { 
Sr tb 
Sequare tr VI) [C)5)))) / 
(4 + w) 
sale = tp 
square := calcul 
SEQ r = 0 FOR size 
SEQ 
senderO [r] 
sender1 [r] 


dummyl [rj] 
dummyO [r] 
2 


square [r] [1] 
Scuüuare [1] [r] 


In] 


PROC cross.node(VAL INT engine, CHAN OF ANY leftin,topin, | 
rightin,bottomin, lefteur ope | 
Eighnbout, Doneemeue, 
FUSE. "c:cdsiol!b “sero. sr UE 
BOOL active 
VAL -S 15 Tl: 
VAT TS 933: 
VAL size IS 8: 
INT Tag w. Cp Nccc 
[Size] [size] INT square: 
[size] [size MINT calc eE 
[size] [size] INT temporal: 
[size] INT dummy0: 
[size] INT dummyl: 
[size] INT dummy2: 
[size] INT dummy3: 
[size] INT dummyd: 
[size] INT sendero: 
[size] INT senderl: 
[size] INT sender2: 
[size] INT sender3: 
WHILE TRUE 
SEQ 
SEQ r= O FOR size 
SEQ CE 0 FOR size 
SEQ 
square Else 
calcule ere 
temporal] mes 
SEQ r= O FOR SizZe 
SEO 
dummyO [r] 
dummy ie] 
dummy2 [r] 
dummy3 Tr] 
dummy4 [r] : 
senderO [r] : 
senderl [r] 
sender2 [r] 
sender3 [r] 
active: = TRUE 
n:= engine 


I O O 


PM Mo H Il 
O O OO 


H n H I 
DO O 03O 
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WHILE active 
SEO 
TF 
n= 1 
SEO 
topin ? tag;w;dummyl 
Enero A y 


TE 
tag- S 
SEQ 
active "FALSE 
p eum Square 
bottomin ? temporal 
peut! temporal 
TRUR 
SEQ 
PAR 
Satin > dummy 0 
Eopin ? dummy 4 


Et Ine dummy2 
looi ein 3 


leftout ! sender2 
Lopou e! sender1 
rightout ! sender2 


bottomout ! sender3 
SEQ r = O FOR size 


SEQ 
semarefr [0] := dummyl [r] 
square[r] [Size -1]:= dummy2 [r] 
square[size - l][r]:= dummy3 [r] 
sguare[0] [r] :- dummy4 [r] 


SEQ r = 1 FOR size - 2 
SEQ c = 1 FOR size - 2 
SEQ 
tp:= ((w * square[r] [c] ) + ( 
Sene EST] + 
( square [r] [c + 1] + ( 
Square Et te] + 
Square NE lie] JN) / 
(4 + wi 
calcu Ek] [c] = to 
SEO 0 FOR Size 
eodeni (04:2 square[r] [0] 


Ee TT 
SEQ r = O FOR size 
SEO 


senderl [r]: 
sender? [r]: 
sender3 [r]: 


sguare[l] [r] 
seer size —2 | 
square[size-2] [r] 
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= 4 

SEQ 
leftin ? tag;w 
EE Cain 


TE 
tag= s 
SEQ 
counterl := 0 
active:= FALSE 
leftout ? Square 
WHILE (counter so 
SEO 
bottomin ? Temporal 
leftout ! temporal 
counterl :- counter] + 1 
WHILE counterl < 5 
SEQ 
Y GE Ee 
leftout ! temporal 
counterl :- counter uM 
TRUE 
SEQ 
PAR 
leftin ? dummy 0 
op dummy1i 


r il m? dummy2 
bottomin ? dummy3 


leftout ! sender0 
topout ! sendero 
rightout !  sender2 


bottomout ! sender3 
SEQ r = O FOR size 


SEO 
square[r] [0] :- dummy0 [r] 
square[size-l] [r] := dummy3 [r] 
square[r] [size-l] := dummy2 [r] 
SEO r = 1 FOR size 2 
SEO € = 1 OR SE 2 
SEQ 
tp:= ((w * square[r] [e] wan 
square [r] (c `n EL 
( square [r] [C cM 
square [| 6-1) se) 2 
square [r * 1] [cC] ) F 
(4 + w) 
calcul [r] Ic] Em 
square: calcu 
SEQ r = O FOR size 
SEQ 


senderO [r] square [r] [1] 
sender2 [r] sguare[r][si2Zce >g 
sender3 [r]:= square[size-2/ EM 


170 


MEG 


SEQ 
Tel ado Y 
I ac W 
TE 
tag- s 
SEQ 
AE] FALSE 
peoescus square 
TROUE 
SEQ 
PAN 
leftin ? dummv0 
acum ? dummyi 


men ? cummy 2 
bottomin ?  dummy3 


HEFEOUT ! senderO 
report ! senderl 
rightout !  sender2 


bottomout ! sender2 
SEQ r = 0 FOR size 
SEQ 
sguare[r] [0] dummy0 [r] 
sguare[0] [r] dummyl [r] 
square[r] [s1ze-1] := dummy2 [r] 
SBO r = 1 FOR size = 2 
SEQ c = 1 FOR size - 2 


SEQ 
tp: ((w * square[r][c] ) + ( 
Saro r I er] + 
OK (arc Ed (Ee 1] + ( 
square IE -i] [c] + 
Suare es I] [Cc] )))) 4 
(4 + w) 
sales - CD 
square :- calcul 
sO r =O FOR size 
SEQ 


senderO [r]: 
sender) (ri: 
sender? [r]: 


square [El 1] 
square 1] [r] 
square [r][size-2] 


Unt 


na 9 


SEQ 
leftin ? tag;w 
TE 
tags S 
SEO 
active:- FALSE 
Eopour square 
bottomin ? temporal 
topout ! temporal 
TRUE 
SEQ 
PAR 
leftin ? dummy 0 
topin? dummy 1 


Bight ine? dummy 2 
bottomin ? dummy3 


leftout ! sendero 
topour |! senderl 
rightout !  senderO 


bottomout ! sender3 
SEO H MO FORO same 


SEO 
square[size-1][r]:- dummy3 [r] 
square[0] [r] :- dummyl [r] 
sguare[r] [0] :- dummy0 [r] 
SEQ r >= 1 FOR s4720. 2 


SEO TC s 1 FOR 22 
SEQ 
tp:= ((w * square[r][c]i) WT 
square [E] TEAM 
( sguare [r] lc Ph 
square [ris 
square [E + l] pp ep 
(4 + w) 
calcul [lt] ei y 
square :— calcu' 
SEO TE 0 FOR sae 
SEG 
senderO [r]: 
senderl [r]: 
sender3 [r]: 


square IE Ia 
square [1 BM 
square [size-2 5 


illa? 


PROC transp.horizontal (VAL INT engine, CHAN OF ANY 

ean toe rrghtsn,bottomin, 
Ne Front Copout, rigAtout, 
EOtEGMOUE 

Use) cc: 'Edsiol ib NUSerio.tsrU"k 

BOCL active: 

VAL S ES 11: 

VAL g IS 333: 

VAL size IS 8: 

mI sag wn: 

Ms ze] INTESpec!: 

[size] INT spec2: 


WHILE TRUE 
SEO 
SEQ r = 0 FOR size 
SEQ 
specl [r] := 0 
SpPecz [r] 7; = O 


n:= engine 
active:= TRUE 


tag:- g 
WHILE active 
SEO 
ER 
n= 12 
SEO 
leftin ? tag 
bottomout ! tag 
IF 
tag= s 
active:= FALSE 
TRUE 
SEO 
leftin ? speci 
rightin? spec2 
leftout ! spec2 
Eagmerour 1 speel 
= 13 
SEQ 
ione Has 
bottomout ! tag 
TE 
oM 's 
active:= FALSE 
TRUE 
SEO 


ME e speel 
rightin? spec2 
leftout ! spec2 
rightout ! specl 


UN 


= 14 
SEQ 
topin™ stag 
dE 
tag = s 
active:= FALSE 
TRUE 
SEQ 
leftin ? speci 
rigntin?: Spec? 
leftout ! spec2 
rightout ! specl 


w < mm vm O t w emm mm NS YS a N s . an "ca emm emm an an an an emm an o oeb om s "CT  . mmm mm emm . om — emm a a 


PROC transp.vertical (VAL INT engine, CHAN OF ANY 
leftin,topin,rightin , bo 
leftout, topout ,rightout, bot tomoda 

#USE `“osvrdsiolibmiserF ie Ha 
BOOL active: 

VAL S [IS 11: 

VAL d ES 393: 

VAL Size IS 8: 

INT a 5 mm: 

[size] INT Specl: 

[size] INT spec2: 


WHILE TRUE 
SEQ 
SEQ r = 0 FOR Size 
SEQ 
Speech” Ei]: 
Spec2 [r] := 0 


n:= engine 
active:= TRUE 
tag:= g 
WHILE active 
SEQ 
IE 
n= 3 
SEQ 
bottomin 2 tag:w spedll 
topout ! tag vV spel 
rrgntourc vog 
TE 
tag- S 
active:= FALSE 
TRUE 
SEQ 
bottomin ? specl 
topout ! speci 
EOp EN sees 
bottomout ! spec? 
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ns 7 
SEO 
diete id Egg 
Leu ! tag 


1012 
tag = s 
active:= FALSE 
TRUE 
SEO 
Icey spec] 
DoC omin? spec2 
COPOU | Spec2 
EE oe Ee 
= 11 
SEQ 


letsel. tag 
LIGAL: as 


DE 
Cage 
active:= FALSE 
TRUE 
SEQ 
ioannes pec 
BEEN BEE 
topout ! spec2 
bottomout ! specl 
PROC neutral.node ( CHAN OF ANY 


keftin,topin,rightin, bottomin, 
juu Dout, FAN EOUL:, bottomeout ) 
ISI AE AVE AsSiolibNaserio. tse": 
BOOL active: 
vas IS 11: 
Ms ol Ts 333: 
INT tag: 
WHILE TRUE 
SEQ 

active:= TRUE 

ele MG] 

WHILE active 

SEQ 
leftin ? tag 
IF 
tag- S 
active:- FALSE 
TRUE 
SEQ 
SKIP 


ME 


— N emm emm emm emm emm emm zm a EO emm emm emm emm emm Y “Y emm S emm YS emm emm emm emm emm emm emm emm emm emm zm N < emm mmm m mmm <  -. . emm — Sd > — 


PLACED PAR 
PROCESSOR O T4 

PLACE channel[0] AT linkOin: 

PLACE channel[1] AT linklin: 

PLACE channel[2] AT link2in: 

PLACE channel[3] AT link3in: 

PLACE antichannel[0] AT link0out: 

PLACE antichannel[l] AT linklout: 

PLACE antichannel[2] AT link2out: 

PLACE antichannel[3] AT link3ouc: 

corner.node(0,channel1[0] ,channel1 [Tj channel NP 
channel[3] ,antichannel[0],antichannel [1 
antichannel[2],antichannel[3]) 


PROCESSOR 8 T4 

PLACE channel[5] AT linkOin: 

PLACE channel[7] AT linklin: 

PLACE channel[8] AT link2in: 

PLACE channel[9] AT link3in: 

PLACE antichannel(5] Ar koel: 

PLACE antichannel[7] AT linkiout: 

PLACE antichannel[8] AT link2out: 

PLACE antichannel[9] AT link3out: 

corner.node(8,channel[5],channel[7],channel[8], 
channel[9],antichannel[5], 
antichannel[7],ancicnannel[oôl, 
antichannel[9]) 


PROCESSOR 2 T4 

PLACE channel[17] AT linkOin: 

PLACE channel[12] AT linklin: 

PLACE channel[18] AT link2in: 

PLACE channel[19] AT link3in: 

PLACE antichannel[17] AT linkOout: 

PEACE antichannel[12]| AT ink on - 

PLACE antichannel[18] AT link2out: 

PLACE antichannel[19] AT link3out: 

corner .node(2,channel[17],channel[12],channel[18], 
channel[19],antichannel[17], 
antichannel([12].,antichannelf Ha. 
antichannel[19]) 
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a DE N QGBƏS ÑaÑa 8Ə TÉ tb 


PLACE 
PLACE 
PLACE 
PLACE 
PLACE 
PLACE 
PLACE 
PLACE 


PROCESSOR 10 T4 


AT linkOin: 
BR linkin: 
channel[22] AT link2in: 
channel[23] AT link3in: 
antichannel[20] AT linkOout: 
antichannel[16] AT linklout: 
antichannel[22] AT link2out: 
antichannel[23] AT link3out: 


channel [ 20 ] 
channel[16] 


corner.node(10,channel[20],channel[16],channel[22], 


PLACE 
PLACE 
PLACE 
PLACE 
PLACE 
PLACE 
PLACE 
PLACE 
Cross 


PLACE 
PLACE 
PLACE 
PLACE 
PLACE 
PLACE 
PLACE 
PLACE 
EFOSS 


channel[23],antichannel[20],antichannel[16], 
antichannel[22] ,antichannel[23]) 


PROCESSOR 1 T4 


channel[10] AT linklout: 
channel[3] AT link2out: 
channel[11] AT link3out: 
channel[12] AT linkOout: 
antichannel[10] AT linklin: 
antichannel[3] AT link2in: 
antichannel[11] AT link3in: 
antichannel[12] AT link0in: 


.node(l,antichannel[10],antichannel[3], 


antichannel[11],antichannel[12], 
channel[10],channel[3], 
channel[11],channel[12]) 


PROCESSOR 9 T4 


channel[13] AT linkiout: 
channel[9] AT link2out: 
chamnel[15] AT link3out: 
channel[16] AT linkOout: 
antichannel[13] AT linklin: 
ami chanmell9] AT link2in: 
antichannel[15] AT link3in: 
antichannel(16] AT link0in: 


.node(9,antichannel[13],antichannel[9], 


am cana dIE N antiëhannei [16], 
channel[13],channel[9], 
channel[15],channel[16]) 


IE 7 


PLACE 
PLACE 
PLACE 
PLACE 
PLACE 
PLACE 
PLACE 
PLACE 


PROCESSOR TEN 


cnannel [24] AT Hinog: 
antichannel [307 AT ink Er 
channel[19] AT link2out: 
Channel [25] AT Mink seur 
antichannel[24] AT linkOin: 
channel[30] AT linklin: 
antichannel[19] AT link2in: 
antichannel[25] AT link3in: 


transp.vertical(3,channel[30],antichannel[19], 


PLACE 
PLACE 
PLACE 
PLACE 
PLACE 
PLACE 
PLACE 
PLACE 


antichannel[25],antichannel[24], 
antichannel[30],channel[19], 
channel[25],channel[24]) 


PROCESSOR IEA 


antichannel[7] AT linkOin: 
channel[26] AT linklin: 
anticnannel[23] AT #imk Ir 
antichannel[29] AT link3in: 
channel[7] AT link0out: 
antichannel( 26] AT nk ean 
cnannel [235 WinkZour: 
channel[29] AT link3out: 


transp.vertical(11,channel[26] ancicnanne lee 


PLACE 
PLACE 
PLACE 
PLACE 
PLACE 
PLACE 
PLACE 
PLACE 


antichannel[29],antichannel[7], 
antichannel[26],channel[23], 
channel[29],channel[7]) 


PROCESSOR 5 T4 


channel[11] AT link2in: 
channel[6] AT link3in: 
channel(13] AT lanço: 
channe1 [14] AT linklin: 
antichannel[11] AT link2out: 
antichannelpej] A TAK oue 
antichannel [13] AE uM 
antichannel[14] AT linklout: 


central.node(5,channel[11],channel[6],channel[13], 
channel[14],antichannel[11],antichannel[6], 
antichannel[13],antichannel[14]) 


IRES 


PROCESSOR 


PLACE 
PLACE 
PLACE 
PLACE 
PLACE 
PLACE 
PLACE 
PLACE 


13 T4 

hannel[10] AT linkO0in: 
antichannel[28] AT linklin: 
eme MESS AT link2in: 
channel 27] “n Link3in: 
antichannel[10] AT link0out: 
channel [28] Ar linklout: 
ante nea nel AT 'Tink2out: 
ant ichannel|27] AT link3out: 


tren EE Her son al l) channel[15] channel[27], 


PLACE 
PLACE 
PLACE 
PLACE 
PLACE 
PLACE 
PLACE 
PLACE 


channel[10],antichannel[28],antichannel[15], 
antichannel[27],antichannel[10], 
channel[28]) 


PROCESSOR 7 T4 


ansgicnannetl lanko0in: 
Channel rd T AT linklin: 
SEID LT TLE 1.0 2 
channel 21I AT link?in: 
channel[26] AT linkOout: 
antichannel[4] AT linklout: 
antichannel[25] AT link2out: 
a'm EE 


Eang er icaló7 channel[25], ,channel[21], 


PLACE 
PLACE 
PLACE 
PLACE 
PLACE 
PLACE 
PLACE 
PLACE 


antichannel[26], 
channel[4],antichannel[25], 
antichannel[21],channel[26], 
antichannel[4]) 


PROCESSOR 15 T4 


channel[30] 
channel[ 32] 


AT linkOout: 

AT ik lout: 
channel[29] AT link2in: 
Channel | An On 

ci i Han mc a A i'n Kk O a 
antichannel[ 32] AT linklin: 
antichannel[29] AT link2out: 
Si ichannel[31] AT Link3out: 


neutral.node(channel[29],channel[31],channel[30], 


channel[32],antichannel[29],antichannel[31], 
am cane 50] antichannel[32]) 
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PLACE 
PLACE 
PLACE 
PLACE 
PLACE 
PLACE 
PLACE 
PLACE 
Cross 


PLACE 
PLACE 
PLACE 
PLACE 
PLACE 
PLACE 
PLACE 
PLACE 
Gross 


PROCESSOR 4 T4 


channel[2] AT link3out: 
channel[4] AT linkoout: 
channel[5] AT TITAK leur: 
channel[6] AT link2out: 
antichannel 2) AT rm bm 
antichannel[4] AT linkOin: 
antichannel 5 AT NIS NE 
antichannel[6] AT link2in: 


.node(4,antichannel[2],anticnannelr[4 1 


antichannel[5],antichannel[6],channel[2], 
channel[4],channel[5],channel[6]) 


PROCESSOR 6 T4 


channel[18] AT link3out: 
channel[14] AT linkOout: 
channel[20] AT linklout: 
channel[21] AT link2out: 
antichannel[18] AT link3in: 
antichannel[14] AT linkOin: 
antichannel[20] AT linkin 
antichannelf21] AT link mn: 


.node(6,antichannel[18],antichannel[14], 


antichannel[20],antichannel[21],channel[18], 


PLACE 
PLACE 
PLACE 
PLACE 
PLACE 
PLACE 
PLACE 
PLACE 


channel[14],channel[20],channel[21]) 


PROCESSOR 12 T4 


channel(32] AT link0in: 
anticnanne! [O AT nk Mm: 
antichannel[2 awe 
antichannel[S8| AT lanka 
antichannel[352] Au ank 
channel p0] aT melo EE 
channel[27] AT Ink ou 
channel [8] Ar Tink out: 


transp.horizontal(12,antichannel[8),channel Sons 


antichannel(0],antichannell 27] ,channel as 


antichannel[32],channel[0],channel[27]) 
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PROCESSOR 14 T4 

PLACE channel[28] AT linkOin: 

PLACE antichannel[17] AT linklin: 

PLACE antichannel[31] AT link2in: 

PLACE antichannel[22] AT link3in: 

PLACE antichannel[28] AT linkOout: 

PLACE channel[17] AT linklout: 

PLACE channel[31] AT link2out: 

PLACE channel[22] AT link3out: 

transp.horizontal(14,antichannel[22],channel[28], 
antichannel[17],antichannel[ 31], 

channel[22],antichannel[28],channel[17], 

channel[31]) 


o'n 


APPENDIX ID 


16 TRANSPUTER NETWORK SOURCE CODE 


PROC input .handler 


-- This 


processors FU 
-- on the network and display the information coming from 


the 


network 
-- when it stops the network. 


— — — — — — o o o o TO o o e TO o NS I N Q e emm emm O “m w mm emm om Q uo. emm V Q p ub “nT w h? ? n “(s a w NS o mmm mmm mm wm s m emm — 


#USE 


VAL 
VAL 
VAL 
VAL 
VAL 
VAL 
VAL 
VAL 


CHAN OF ANY leftin 
PLACE 
PLACE 
PLACE 
PLACE 


VAL 


(CHAN OF ANY keyboard,screen) 


procedure 
and 3 


send the boundary conditions 


-- Channel and link decla. 


“eN tdsiolib serie ts e 


Vink Oe 
Vie LOUE 
imi 2 Cue 
link JOUE 
IRA 
AA 
Tynk22n 
ar ke Sr 


s IS 11: 


IS 
T. 


VAL esc IS 223: 
VAR EA 157335. 
VAL size IS 6: 


[size] 
[size] 
[size] 
[size] 
[size] 


INT temp: 
NS sp: 
THE weep: 
ENT see. 
[size] ENT Eb E 


p 


-] O Ul > C) ho P 


,rightout,antirightout,antileftin: 


Ter tin AT Linkin: 
rightour FAT ins ob: 
antirigntout AT lin cr: -- external channels 
antileftin AT link2in: 


-- placement of 


-- Array declarations 


[16][size] [size] INT true: 
BOOL turning: 
INT w, tag,he,no,z,counter, counter s 


SEQ 
Mo 20 
write.full.string (screen, " Enter the hot end 


newline(screen) 


NO: z0 
write.full.string (screen, " Enter the propagation 


temperature") 
read.echo.int (keyboard,screen,he,no) 


rate ") 


read.echo.int (keyboard, screen,w,no) 


newline(screen) 
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SEO 
SEQ r - 0 FOR size sent one on OF 
SEQ dS 
temp [r] :- 
recp elk = 
recpu [2] 
CECI s 
SEQ r = 0 FOR siz 
temp [r] := he 


0 
0 
e 


tag:= g 
antirightout ! tag;w;temp -- sending hot end and W 
PIC AAO wee eine -- and start signal 


antirightout ! recp2 
righ out=! recpl 
turning :- TRUE 
SEQ 
WHILE turning 
PRI ALT 
keyboard ? z -- receive stop signal 
SEQ 
IF 
Z - esc 
SEQ 
SEQ 
tag:= s 
antileftin ? recp 
leftin ? recp 
antirightout ! tag;w;temp 
rightout ! tag;w;temp 


counter := 0 

counter x O 

WHILE counter < 16 -- receiving 
SEQ es 


Ann? truly 
SEO ho = O HOË size 
SEQ p = O FOR size 
mane counter] [h] [p] := 
truly [h] [p] 


counter := counter + 1 
SEO 
SEQ r = 0 FOR size - 1 
SEO 
SEQ C = 0 FOR size - l 
SEQ 


txt:= true [counterl] 
[r] [c] 
write.int (screen,txt,3) 
SEQ I = 1 FOR size - 2 
SEQ 
txt:= true [counterl + 
E 


write.int (screen,txt,3) 


E2 3 


SEQ f = 1 FOR size =" 2 
SEQ 
txt:= true [counterl + 
ST TM] 
write.int (screen, ts 
SEQ d = 1 FOR size - 1 
SEO 
txt:= true [counterl*tl2] 
[r] (es 
write.int (Screen, txts 
newline(screen) 


counterl:= counter 
SEQ r = 1 FOE Size co 
SEO 
SEQ c 2-0 FOR size - l 
SEO 


txt:= true [counteu Wi 
[r] Tel 
write.int (screen, txt ED 
SEQ 1 = 1 FOR size - 2 
SEQ 
txt:- true[counterl+4] 
dän. 
write.int (screen, txts 
SEO f = 1 FOR size ma 
SEQ 
txt:= true [counter 
8] [r] g 
write.int (screen,txt,3) 
SEQ d = 1 FOR size =mi 
SEO 
txt:= true [counterl+12] 
[r][d] 
write.int (screen,txt,3) 
newline(screen) 


counterl:= counterl + 1 
SEQ r = 1 FOR size - 2 
SEQ 
SEQ c = 0 FOR Size 
SEO 


txt: true [counter 
Ra (re) 
write.int (screen, txts 
SEQ l = 1 FOR size - 2 
SEO 
txt:= true [counterl eg 
1] [aaa 
write. int (screen, txt Has 
SEO f = 1 FOR size zz 
SEO 
txt: true [counters 


8] [r] [£f] 
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velte dnt (seteen CXU, 3) 
SEQ d = 1 FOR size - 1 
SEO 
eser serlesskeounter| + 
12][r][d] 
write.int (screen,txt,3) 
newline(screen) 


counterl:= counterl + 1 
SEQ r = 1 FOR size - 1 
SEO 
SEQ c = 0 FOR size - 1 
SEQ 


cp ere Omer | 
pe] (61 
write.int (screen,txt,3) 
SEO 1 = 1 FOR size - 2 
SEO 
txt:= true [counterl + 
DEMAS 
write.int (screen,txt,3) 
SEQ f = 1 FOR size - 2 
SEQ 
txt:= true[counterl+8] 
Re ae] 
write.int (screen,txt,3) 
SEO d = IT FOR size - 1 
SEO 
Et: “Le counter! +12] 
E 
write.int (screen,txt,3) 
newline(screen) 
turning := FALSE 
newline(screen) 
antileftin ? recpl 
SEQ 
leftin ? recp2 
antcirigntcout tad NE» 
rightout ! tag;w;temp 
anpsrgneoutec resp» 
Mc ro || EP 
newline(screen) 
wa idee -Ering (screen, “Type ANY to return to TDS") 
TM any: 
read.char(keyboard, any) 
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— VAL 1:000 O 
VAL Link louti Ro ore 
VAL -TINkZOUE DIS 
VAL». Link Scheren: 
VAL ink im TS de 
VATI mk TS 5: 
VAR link2in IS 6 
VAR ne sem [SN 
i33] CHAN OF ANY channel,antichannel: 


— Y Í emm emm 9 k k mi a ee W a u c emm mmm mi I H ae emm e emm emm emm cee emm emm emm emm W emm emm emm emm emm s“ m emm k 


PROC central.node(VAL INT engine, CHAN OF ANY 
leftin,topin,Fiahtin, boeke mm 
leftout ,topout, rightout, Pporteene 


-- This procedure does the calculations for nodes at the 
center -- of the network 


— Y ===> === s s == SN WEE p s p mk == == => I ===> ===> ===> ===> ===> ===> SN => ===> => ===> ===> ===> ===> => == ===> ===> emm == == == === O — e 


—----- VALIS PTS” lD: 
VAL MESS 33: 
VAL size IS 6: 
[size] [size] INT square: 
[size] [size] INT calcul: 
[Size] INT dummyO: 
[size] INT dummyl: 
[size] INT dummy2: 
[size] INT dummy3: 
[Size] INT dummy4: 
[size] INT sendero: 
[size] INT senderl: 
[size] INT sender2: 
[size] INT sender3: 
[size] [size] INT temporal: 
BOOL active 
INP Cag w, CPA: 
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WHILE TRUE 


SEQ 
SEQ r= 0 FOR size meiau SS 
seel. 0 FOR Size 
SEQ 
square Ele) -= 0 
calcule] -= 8 
temporal Ape. :="0 
SHO r= 70 FOR Size 
SEQ 
dummyO [r] := 0 
dummyl [r] := 0 
als oe] O 
dummy3 [r] := O 
dummy4 [r] := O 
sender0 [r] := O 
senderl [r] :- O 
sender2 [r] :- O 
sender3 [r] :- O 
active:- TRUE 
n:- engine 
WHILE active 
SEQ 
LE 
(n= 5) OR (ns 9)-- code for processors 5 and 9 
SEQ 
leftin ? tag;w --receiving start/stop 
rightout ! tag;w -- sending start/stop 
TE 
tag= s 
SEQ 
active:= FALSE -- checking for stop 
topout ! square -- routing code to 
bottomin ? temporal 
topout ! temporal 
bottomin ? temporal 
topout ! temporal 
TRUE 
SEQ -- Communications receive 
PAR -- send boundaries 
-- Conditions 
def > dummy0 
LEEN ? dummy 1 


EECH, dummy?2 
bottomin ? dummy3 


leftout ! sendero 
Lopour | senderl 
rightout ! sender2 


bottomout ! sender3 


W^ 


SEO FE soe IZE 
SEO 
squares d 
square [sm 


dummyl [r] 
dummyO [r] 
square[r] [size - 1] :— dun — 
square(size - 1] [r] := dumm; MEM 
SEQ r = 1 FOR size zZ 
SEQ C = I FOR SSE 
SEO 
tp:= ((w * square [r][c] EM 
square E 
( square [r] [C * 1] * ( 
square pr er + 
square [r + 1] te) JM 


(4 + W) 
calcul [£1 [<W “ED 
square EE calc 
SEO r = 0 FOR Size 


SEQ 
sender0 [r]: 
senaerl [r]: 


square [Fpp 
square [1] pa 
sender2 [r]:- sguare[r][size WW å 
sender3 [r]:= square[size-2] [Fr] 
(n= 6) OR (n= 10) -- code processors 6 and 10 
SEO -- in the network 
leftin ? tag;w 
rightout ! tag;w 


TE 
tag= s 
SEO -- checking stop 
active:= FALSE 
topout ! square -- routing code 
bottomin ? temporal 
topout ! temporal 
TRUE 
SEQ 
PAR == COMMUNICATIONS BLOCK 
leftin ? dummy0 
LOD ine dummy 1 


rightin? dummy 2 
bottomin ? dummy3 


dedu: sendero 
topout ! senderl 
rigntout ' sender” 


bottomout ! sender3 
SEQ r = 0 FOR size 
SEO 
square[0] [r] :s dummy id 
square[r] [0] := dummy0 [Fr] 
SOIT og dummy2 [r] 
square[size -1][r] dummy3 pa 
SEQ r = 1 FOR size - 2 


l H 
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EO C = 1 FOR size - 2 
SEO 
Dee wesseWlerer[r|[e]| ) + ( 
S nce ape | 
Scar en tE [EE Li # ( 
see EI te] + 
Seguerra + (El ON) / 
(4 + w) 
eodeni wm el tp 
square := calcul 
SEQ r = 0 FOR size 
SEQ 
sender0 [r 
sender1 [r 
sender2 [r 
sender3 [r 


Seu crei mu Å 
Square ft |) Lr | 
sen el size 2] 
square[size- 2][r] 


IS 
Is 
(E 
IE: 


PROC corner.node(VAL INT engine, CHAN OF ANY 
ME EE ELgatin Dot tomin, 
lié soutsuur, ridlitout, DOttomout) 


-- This procedure drives the execution of the processors 
at the corners 
-- of the array 


AS GC. t0s lol Tb userio.tsri: 


ES ITS 11: 

KAL G IS 333: 

VAL size IS 6: 

[Size] [size] INT square: 
Tie MIS ie INT- calcul: 
[size] [size] INT temporal: 
Men Ze INT cummy0o: 

[size] INT dummyi: 

sie AN mm 2 : 

ES ZEN dummys3: 

EZE INT dummy4 : 

[size] INT sendero: 
[size] INT sender): 
[Size] INT sender2: 
[Size] INT sender3: 

BOOL active 

mild ea wel n counter0: 
WHILE TRUE 
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SEO 
SEQ r= 0 FOR size -- Initializationtoei ir å 
SEQ c= 0 FOR size 
SEO 
sela kea ae 
Calcule | Ken 
temporal [r] [c] 
SEQ r= 0 FOR size 
SEQ 
dummyO [r] 
dummyl [r] 
dummy2 [r] 
dummy3 [r] 
dummy4 [r] 
senderO [r] 
senderl [r] 
sender2 [r] 
sender3 [r] 
active:= TRUE 
n:= engine 
WHILE active 
SEQ 
TE 
ns 0 -- code for processor 0 
SEO 
topin > tag. vw dim I 
EIgntouts N tog w 
bottomout | tag VW aummy l 
TE 
tag= s 
SEO -- checking for stop 
counter0:- O 
active:- FALSE 
topout ! sguare 
WHILE counterO < 3 
SEO -- screen array information 
bottomin ? temporal 
topout ! temporal 


loo 


H HHH H 
OO OO OC 


OD O OI 


counter0 :- counter0 + l 
WHILE countero < 15 
SEO 


rightin ? tempora 
topout ! temporal 


counter0 :- counter0 + 1 
TRUE 
SEO 
PAR 
ieftin 2 dummy0 
O 2 dummy 4 


rigen dummy2 
Dottomin 2 am 
leftout ! senderO 
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n= 


SEQ 


SEQ 


EOpout  ! senderl 

rightout ! sender2 

bottomout ! sender3 
SEQ r = 0 FOR size 


SEQ 
Ssquare[r] [0] := dummyl [r] 
sguare[r] [size - 1] := dummy2[r] 
Square[size - 1] [r] := dummy3[r] 
SEQr = 1 FOR size - 2 
SEQ c = 1 FOR size - 2 
EP ms square [r] [fè] ) +i 


Sq ME] Tc 1] + 

( Square [r] [C + 1] + ( 
stare 1] Ee] CH 
sade iz eq NN) 7 
(4 + w) 

cul rr] [e]: tp 
SEQ FOR size 
ealeul Tr) $0]: square[r] [0] 


Square :- calcul 
SEO r = O FOR size 
SEQ 


senderO [r]: 
senderl [r]: 
sender2 [r]: 
sender3 [r]: 


Square [Jal 
Sene IE [EI] 
square[r][size-2] 
square[size-2][r] 


-- code for processor 3 


bottomin ? tag;w;dummy3 
topout ! tag;w;dummy3 
lO OU vw 


TE 


tad- s 
SEO 
active:- FALSE 
topout ! square 
TRUE 
SEO 
PAR 
leftin ? dummy0 
copin ? dummy 1 


reden > dummy 2 
bottomin ? dummy4 


ER eE ! Sender? 
Lopout ! sender1 
rightout !  sender2 


bottomout ! senderl 


SEQ r = O FOR size 


JS 


SEO 
square[r] [0] dummy3 [r] 
square[0] [r] dummyl [r] 
sguare[r] [size -1]:= dummy2 [4] 
SEQ r = l FOR sizé w 
SEQ C = 1 FOR Size - 2 
SEQ 
tp:= ((w * sguare[r][c]) + ( 
square [r] [c-1] + 
( square [r] [C r 1 
square [r= nme 
square [r + 1] [c] nn 
(4 + w) 
calcul [ri cine 
SEQ r = O FOR size 


calcul rr] [0] := square [ra 
square := calcul 
SEO r = O FOR size 

SEQ 


senderl [r] 
sender2 [r] 


square] E 
square[r][sSizZze-s 


magi? -- code for processor 12 
SEQ 
e'n ag: 
TE 
tag= s 
SEO 
counter0 := 0 


active:= FALSE 
leftout ! square 
WELLE Councero 3 
SEQ 
bottomin ? temporal 
leftout ! temporal 


counter0:= counter0 + l 
TRUE 
SEQ 
PAR 
Left timer dummy 0 
COpi nee dummy 1 


rigntin 2 dummy?2 
bottomin ?  dummy3 


leftout ! senderO 
topout ! sender3 
rightout !  senderO 


bottomout ! sender3 


J 52 


SEO E = 0 FOR Size 


SEQ 
square[r] [0] := dummyO [r] 
sguare[size - l][r]:- dummy3 [r] 


SEQ r = 1 FOR size - 2 
SEQ c = 1 FOR size - 2 


SEO 
Epi EN w Square [E ej). + { 
Square Brinca e 
( square [r] [c + 1] + ( 
square [r-1] [cl + 
square [rr 1] [c] ))))/ 
(4 + w) 
salie cH - “Cp 
square := calcul 
SEQ r = 0 FOR size 
SEQ 
senderO [r] :=square [r] [l] 
Sender3 [r] :=square [size-2][r] 
n= 15 -- code for processor 15 
SEQ 
leftin ? tag;w 
IF 
tags s 
SEO 
active:= FALSE 
topout ! square 
TRUE 
SEO 
PAR 
leftin ? dummy O 
Ce 2 dummyl 


rigntin ? dummy 2 
bottomin ? dummy3 


lertout + senderO 
dte out `! senderl 
rightout !  senderO 


bottomout ! senderil 
SEO r = 0 FOR size 
SEO 


square[0) [Fr] := dummyl [r] 
ense ao) = dimmyO [r] 
SEQ r = 1 FOR Size - 2 


SEQ c = 1 FOR size - 2 
SEQ 
tp:= ((w * square [r][c] ) + ( 
sets lc Y] +H 
ha EC 1] + ( 
square [r-1] [c] + 
Square JJ [C] )))) / 


(4 + w) 


calcul [rj] [c J Km 
square := calcul 
SEQ fr = 0 FOR size 


SEO 
senderO [r] 
senderl [r] 


square sl 
square Tm 


1] 
r] 


PROC cross.node(VAL INT engine, CHAN OF ANY 
leftin,topin, rightin sod 
leftout ,topout,ridghteut. bote 


-- This procedure drives the processors which are 
situated 

-- forming a 

-- croos at the square network 


VAL S IS «L: 

VAL GS 3533: 

VAL Size IS 6: 

[Size] [size] Ni Seme. 

[Size] [size] INT calcul: 

[size] [size] INT temporal: 

[size] INT dummyO: 

[size] INT dummyl: 

[size] INT dummy2: 

[Size] INT dummy3: 

[Size] INT dummy4: 

[size] INT sendero: 

[size] INT senderl: 

[size] INT sender2: 

[size] INT sender3: 

BOOL active 

INT tag,w,.cp,n,eounce ae 

WHILE TRUE 

SEO 
SEO r= 0 FOR size -- Initialization ofarray 
SEO Cm O FOR Sage 
SEQ 
square m 
calcul [r] eg 
temporal EE 
SEO teo FORM qe 


loo 


SEO 
dis Ee O 
dummyl [r] :- O 
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dummy2 [r] 
dummy3 [r] 
dummy4 [r] 
senderO [r] 
senderl [r] 
sender2 [r] 
sender3 [r] 
active:= TRUE 
n:- engine 
WHILE active 
SEQ 
LE 
n= 1 -- code for processor C" 
SEQ 
topin ? tag;w;dummyl 
slet ou Gad. w 
BRET -- sending start/stop signal 
tag- s 
SEO -- checking for stop 
active:= FALSE 
topout ! square -- routing code 
bottomin ? temporal 
topout ! temporal 
bottomin ? temporal 
topout ! temporal 


H W I 
O 


l n Ww I 
OO OO 


TRUE 
SEO 
PAR 
leftin ? dummy 0 
topine? dummy 4 


Blencın ? dummy2 
bottomin ? dummy3 


leftout ! sender2 
Copo! senderl 
rightout ! sender2 


bottomout ! sender3 
SEQ r = O FOR size 


SEQ 
square[r] [0] := dummyl [r] 
sguare[r] [size -l]:= dummy2 [r] 
Square[size - l][r]:= dummy3 [r] 
square[0] [r] :» dummy4 [r] 

SEQ r = 1 FOR size - 2 

SEQ c = 1 FOR size - 2 

SEQ 
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tp: ((w * square [r][C] ta 
square es MA 
( square [r] [C + 1] + ( 
Square ee. iN. 
square [r * 1] [c] )))) / 
(4 + w) 
calcul [©] (eds 
SEQ r = O POR FS EE 
calcul [r] [0]:- sguare[r] (GN 


sguare :- calcul 
SEO r = G FORM Ee 
SEQ 


senderl [r]: 
sender2 [r]: 
sender3 [r]: 


square[1] [r] 
sguare[r][siZeg 
square[size-2] [r] 


ns 2 -- code for processor 2 
SEQ 
bottomin EI I 
BLAINEQUE ad sw 


T 
tag= s 
SEQ 
active:= FALSE 
topout ! square 
bottomin ? temporal 
topout I temporal 
TRÜE 
SEG 
PAR 
leftin ? dummy0 
CODINA? dummy 4 


EG EIN o dummy?2 
bottomin ? dummy3 


leftout ! sender2 
tepeuc,! senderl 
rightout !  sender2 


bottomout ! sender3 
SEQ r = 0 FOR size 
SEQ 
square[r] [0] := dumnayi q 
sguare[r] [size-l] :- dummy2 [r] 
sguare[size - l][r] := dummy kg 
square[0] [r] := dummy4 [r] 
SEQ r = 1 FOR Sizer = 
SEO c = 1 FOR size 2 
SEQ 
tp:= ((w * square [r][c]) + ( 
square [IAE IS 
( sguare [r] [cC FV 
square [r i m 
square [r + 1] [c] 'O' mi 
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(4 + W) 


calce] 22 tp 
SEQ r = O FOR size 
carcul [r] [0]:= square|ri [9] 
square := calcul 
SHO © = O FOR size 


SEQ 
senderl [r]: 
sender2 [r]: 
sender3 [r]: 


squaere[r] Tr] 
square[r][size- 2] 
Square [Sue 2 | 


n= 4 -- code for processor 4 
SEQ 
lertine EC 
preco ed ; W 


Naka 
tags Ss 
SEQ 
counter) := O 
active:= FALSE 
leftout ? square 
WHILE counterl < 3 
SEQ 
bottomin ? temporal 
leftout ! temporal 
Counter] :z counterl + 1 
WHILE counterl < 11 
SEO 
rightin ? temporal 
leftout ! temporal 
counterl := counterl + 1 
TRUE 
SEQ 
PAR 
Benin? dummyo 
orn dummy 1 


HEEN dummy?2 
bottomin ? dummy3 


leftout ! sendero 
topoue ! sendero 
rightout ! sender2 


boecomout ! sender3 


CEO EE.) FOR size 


SEQ 
sguare[r] [0] := dummy0 [r] 
square[size -l][r] := dummy3 [r] 
sguare[r] [size-l] :- dummy2 [r] 
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1 FOR size - 2 


SO 1 FOR size - 2 
SEQ 
tp:= ((w * square[r] [c] — 
square Tre u | 
( Square [r] [c * E 
square EE 
square [r + 1] [ce] )))) / 
(4 + w) 
calcul [tr] fe] Er 
square :- calcul 
SEQ r = 0O FOR size 
SEQ 


senderO [r] 
sender2 [r] 
sender3 [r] 


square [rip 
square[r][size ag 
square[size-2][r] 


n= 8 -- code for processor 8 
SEQ 
leftin-? tag w 
righeout ! saad w 


TE 
tag= S 
SEO 
counterl := 0 
active:= FALSE 
leftout ? square 
WHILE counterl < 3 
SEQ 
portomin 2 temporal 
leftout ! temporal 
counterl := counterl + 1 
WHILE  counter1 « 7 
SEO 
racine mmc 
Jeftout ! temporal 
counterl := Counter! N 
TRUE 
SEQ 
PAR 
T dummy0 
COPIAN: dummy 1 


EOS dummy 2 
bottomin ? dummy3 


leftout ! sendero 
topout ! sendero 
rightout ! sender? 


bottomout ! sender3 
SEQ r = O FOR size 


SEQ 
square[r] [0] := dummyO [r] 
square[size - 1][r]:- dummy3 [r] 


SS 


square(r][size -1] :- dummy2 [r] 
SEQ r = 1 FOR size - 2 
SEQ C = 1 FOR size - 2 


SEO 
O MAS Square ape] ) + ( 
square [r] [cer] + 
(ME Uare INN er: 4 
Square MIE] s 
square Me) ier y 7 
(4 + w) 
Galeno fr) [Cc] := ep 
square := calcul 
SCO O FOR size 
SEQ 


senderO [r] 
sender2 [r] 
sender3 [r] 


Sguare|r | 1] 
SQUARE RAS iz in] 
Square srce Jr) 


W NON 


(n= 7) OR (n= 11) -- code processor 7 and 11 
SEO 
leftin ? tag;w 
EIGhrOut ss cag; W 
Se 
tags S 
SEQ 
active:= FALSE 
topout ! square 


TRUE 
SEQ 
PAR 
leftin 7 dummy0 
tepin ? dummy 1 


Enea dg 2 dummy 2 
bottomin ? dummy3 


leftout ! senderO 
ponour 1 senderl 
rightout ! sender2 


bottomout ! sender2 
SEO r = O FOR size 
SEQ 


Sguare[r] [0] "e dummy0 [r] 
sguare[0] [r] :- dummyl [r] 
sguare[r] [size- l]:- dummy2[r] 


SEQ r = 1 FOR size - 2 
SEQ C = 1 FOR size - 2 
SEQ 


tp:= ((w * square [r] [c])+ ( 
square [r Ir c mm 
( square [re] te PD eo 
square [ESE Fem, 


square [r + 1] [c] NN 
(4 + w) 
calcul [ei [fem EM 
Square := calcul 
SEO r = 0 FOR SIZE 


SEO 
sender0 [r] 
senderl [r] 
sender2 [r] 


square |r ERE 
square [1] [rj 
square[r | Tsmz =au 


n= 13 -- code for processor 13 
SEO 
leftin ? tag;w 
TE 
tags s 
SEO 
active:= FALSE 
topout ! square 
bottomin ? temporal 
topout ! temporal 
bottomin ? temporal 
copouc ENPE RI 


TRUE 
SEO 
PAR 
Left mn: dummy0 
topin >? dummy 1 


prODEJD 2 dummy 2 
Bot comin 2 dummy 


leftout ! sender0 
topout ! senderl 
rightout !  senderO 


bottomout ! sender3 
SEQ rr. MS FOR Size 


SEQ 
square[0] [r] :- dummyl Tri 
sguare[r] [0] :- dummyO [r] 
sguare[size - l][r]:- dummy3 [Tr] 


SEQ r = 1 FOK Size 77 
SEQ c = 1 FOR size - 2 
SEQ 

tp:= ((w * square [rj[c]) + ( 
square [r] [{c-l] + 
( square fr) [Cc + 13 + ( 
square [r-1] [c] + 
square [r + 1] [c] )])) 
(4 + w) 


ele ho] [e] -= tp 


square := calcul 
SEQ r = 0 FOR size 
SEQ 
sendero [r] := square [r] [1] 
senderl [r] := square [1] [r] 
sender3 [r] := square[size-2][r] 
naa -- code for processor 14 
SEQ 
Teftin.? kag:w 
TE 
tag= s 
SEQ 
active:= FALSE 
HERE square 
bottomin ? temporal 
topout ! temporal 
TRUE 
SEO 
PAR 
leftin ? dummyo 
el EE dummyl 


again dummy2 
bottomin ?  dummy3 


leftout ! senderO 
I O senderl 
rightout ! sender0 


bottomout ! sender3 
SEQ © = 0 FOR size 
SEQ 
square[0] [r] dummyl [r] 
sel e NONI dummyO [r] 
sguare[size - l][r]:- dummy3 [r] 
SEOST DOI FOR size - 2 
SEQ c = 1 FOR size - 2 
SEO 
tp:= ((w * square [r][c] ) + ( 
square [r] [c-1] + 
( square [r] [C * 1] * ( 
same MIE + 
square 11 [6] )))) 4 
(4 + w) 
calcul [ei [ce] :=+ tp 
Square := calcul 


SEQ r 20 FOR size 
SEQ 
senderO [r] 
senderl [r] 
S S De] 


square [r] [1] 
alr bL] [nr] 
SM | 2e e] 


lal 


— a a m NS Q emm om eee le ou Loo emm wm an om an an an an an an an an om an emm an an an an emm emm NS h i an an an a es es an an an 


ELACED (PAK 
PROCESSOR 0 T4 


PLACE 
PLACE 
EXA 
PLACE 
PLACE 
PLACE 
PLACE 
PLACE 


AT ao an: 
AY tank] ime 
channel[2] AT link2in: 
channel [3] AT Ton sine 
antichannel[0] AT linkOout: 
antichannel[1I] AT Minki Es 
antichannel[2] AT Link2sue: 
antichannel[3] AT lini oue 


channel[0] 
channel[1] 


corner.node(0O,channel[0],channel[1],channel[2], 


PLACE 
PLACE 
PLACE 
PLACE 
PLACE 
PLACE 
PLACE 
PLACE 
Cross 


PLACE 
PLACE 
PLACE 
PLACE 
PLACE 
PLACE 
PLACE 
PLACE 
GEOSS 


channel(3],antichannel[0],antichannel[1], 
antichannel[27,/antichanne W jp 


PROCESSOR 6°14 


channel(5] AT linkOin: 
channel[7] AT linklin: 
channel[8] AT link2in: 
Cchanneki9] AT Hi ide 
antichannelpo | Ar IDEO 
ancicnannel[l |T sou 
anticenannel [3] AF meo 
antleEhanne la oue; 


.node(8,channel[5],channel[7],channel[8], 
channel[9],antichannel[5),antichannel a 


antichannel[8],antichannel[9]) 


PROCESSOR 2 TÉ 


channel[17] 
channel[l2] 


AT TnKO in: 
AT linklin: 
channel[ 18] AT link inm: 
channel[19] AT link3in: 
antichannelf17]) Mr |inkOEK 
antichannel (l2) AT Tink iout: 
anticnannel ps ATTIN o 
ant ichannel (19 Aru 


.node(2,channel[17],channel[12],channel[18] 
channel[19] , antichannelf[17], antichannme E 


antichannel[18] ,antichannel[19]) 
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PLACE 
PLACE 
PLACE 
PLACE 
PLACE 
PLACE 
PLACE 
PLACE 


central.node(10,channel[20],channel[16],channel[22 


PLACE 
PLACE 
PLACE 
PLACE 
PLACE 
PLACE 
PLACE 
PLACE 


Cross. 


PROCESSOR 10 T4 


AT linkin: 
AT linklin: 
channel[22] AT link2in: 
channel[23] AT link3in: 
antichannel[20] AT linkOout: 
antichannel p6] AT linklout: 
antichannel[22] AT link2out: 
ane chances ene eat 


channel [20] 
channel[16] 


] 
channel[23],antichannel[20],antichannel[16] 
] 


anerchennmelr227],antichannel[23 


PROCESSOR l1 T4 


C ne OT Linklout: 

Channel 3] AT link2out: 

channel 1E dT out: 

channel[12] AT linkOout: 
antichanmell1060]' AT linklin: 
antichannel[3] AT link2in: 
anerehannekel 1] AT lank Zin: 
antichannel[12] AT linkOin: 
node(1,antichannel[10],antichannel[3], 


antichannel[11], ancichannel[12],channe1l[10], 


PLACE 
PLACE 
PLACE 
PLACE 
PLACE 
PLACE 
PLACE 
PLACE 


channel[3],channel[11],channel[12]) 


EROCESSOR 9 14 


eden] AT linklout: 
channel[9] AT link2out: 
channel[15] AT Bk3out: 
channel[16] AT linkOout: 
antichannel[13] AT linklin: 
antichannel[9] AT link2in: 
antichannel[15] AT link3in: 
antichannel[16] AT linkOin: 


cm EU noder ancichannel[ 3] antichannel[9], 
antichannel[15],antichannel[16],channel[13], 


channel 9] channel[15] channel[16]) 
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PLACE 
PLACE 
PLACE 
PLACE 
PLACE 
PLACE 
PLACE 
PLACE 


PROCESSOR 3 T4 


channel[24] AT linear 
antichannel[307 AT Angle 
channel[19] AT link2out: 
channel[25] AT link3out: 
antichannel [24 DAT nad 
channel[30] AT linklin: 
antichannel{ 19] AT Tink210F 
antichannel[25] AT lanka: 


corner.node(3,channel[30],antichannel[19], 


PLACE 
PLACE 
PLACE 
PLACE 
PLACE 
PLACE 
PLACE 
PLACE 
CrOSS 


antichannel | 25], anutenante pase 
antichannel[30],channel[19], 
channel[25],channel[24]) 


PROCESSOR 11 T4 


antichannelf7] AT pi roO m. 
channel[26] 2 TIE 
antichannel[25] AT Lik nat 
antichannelf29 | T 
channel [7] AT Finse: 
antichannel[26] AT linkiout: 
cnannel [25] AT linkZouE: 
channel [297 AT Link cu: 


.node(11 channel(26] antichannel( 237, 


antichannel[29],antichannel[7], 


antichannel[26],channel[23],channel[29], 


PLACE 
PLACE 
PLACE 
PLACE 
PEACE 
PLACE 
PLACE 
PLACE 


central.node(5,channel[11],channel [6 | .channe eae 
channel[14],antichannel[11],antichannel[6], 


channel[7]) 


PROCESSOR 5 T4 


channel [ 11 l Arink? imn: 
channel[6] AT link3in: 
channel[13] AT linkOin: 
channel[14] AT linklin: 
antichannelfi ii] AT our 
antichannel[6] AT link3out: 
antichannel[ erte 
antichanneli ia] Appl in UL 


antichannel(13] antichannel[14]) 
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PLACE 
PLACE 
PLACE 
PLACE 
PLACE 
PLACE 
PLACE 
PLACE 
Cress 


PLACE 
PLACE 
PLACE 
PLACE 
PLACE 
PLACE 
PLACE 
PLACE 
CEOSS 


PLACE 
PLACE 
PLACE 
PLACE 
PLACE 
PLACE 
PLACE 
PLACE 


PROCESSOR 13 T4 


channel[10] AT linkOin: 
antichannel[28] AT linklin: 
channel ls HE Yni 2n 
channel[27] AT link3in: 
antichannel[10] AT linkOout: 
channel[28] AT linklout: 
aneichannehplks] AT Link2out: 
ant ichanneo pia 7] AT Link3gout : 


maa ena 5] ehanneli 27] .channel [10], 


Ere Tallie || 26] antichannelT 15], 
antichannmelr27]J;,anticnannel[10], 
channel[28]) 


PROCESSOR 7 T4 


antichannel[26] AT linkOin: 
channel[4] AT linklin: 
Elanneli2s Ar Eink2in: 
channel[21] AT link3in: 
channel 26] AT Imnkoout.: 
antichannel[4] AT linklout: 
antichannel[25] AT link2out: 
art ichannel[21] AT link3out: 


.node(7,channel[25],channel[21],antichannel[26], 


channel[4],antichannel[25],antichannel[21], 
channel[26],antichannel[4]) 


EROCESSOR 15 T4 


channel [ 30] 
channel [ 32] 


AT linkOout: 

puli linkloul: 
channel[29] AT link2in: 
Channel M AT Link3in: 
#nEIEKannEI AS] AT linkO0in: 
am sac banner 32 ir inklin: 
a cnaanel 29 ANpunk2out: 
am ichammel[ 31] AT']1nk3out: 


corner.node(15,channel[29],channel[31],channel[ 30], 


channel[32],antichannel[ 29], 


antichannel[ 31] antichannel([ 30] antichannel[32]) 


PLACE 
PLACE 
PLACE 
PLACE 
PLACE 
PLACE 
PLACE 
PLACE 
Cross 


PROCESSOR 4 T4 


channel [2] 
channel|[ 4] 


ANS uE: 
AT linkO0out: 
channel[5] AT linklout: 
channel[6] AT link2out: 
anerehannelı 217 Ar leink3in: 
antichannel[4] AT linkOin: 
antichannel[5] AT linklin: 
ancichannel[6) AT AMNk2in: 


mode 1 an cn annel] 21 ancichannel[a], 


antichannel[5],antichannel[6],channel[2], 


channel[4],channel[5],channel[6]) 
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PROCESSOR 6 T4 

PLACE channel[18] AT link3out: 

PLACE channel[14] AT linkOout: 

PLACE channel[20] AT linklout: 

PLACE channel[| 29] STR Pr. 

PLACE antichannel[18] AT link3in: 

PLACE antichannel[143 AT linx0in: 

PLACE antichannelp20] aT dangan 

PLACE antichannel[21] AT link2in: 

central.node(6,antichannel[18],antichannel[14], 
antichannelf20],antichannel"2lj,channel[ 
channel[1i4],channel[20],channel[21]) 


PROCESSOR TZ aie 

PLACE channel[32] AT linkOin: 

PLACE anticnannel[0] AT is 

PLACE antichannel[27] AT link2in: 

PLACE antichannel[8] AT link3in: 

PLACE antichannel[32] AT linkOout: 

PLACE Channel [0 | AT Een 

PLACE Channel| 27 | PA ike. 

PLACE channel[8] AT link3out: 

corner .node(12,antichannel[8],channel[32], 

antichannel[0],antichannel[27],channel[8], 
antichannel[32],channel[0],channel[27]) 


PROCESSOR 14 T4 
PLACE channel[28] AT link0in: 
PLACE antichannel[17] AT linklin: 
PLACE antichannel[31] AT link2in: 
PLACE antichannel[22] AT link3in: 
PLACE antichannel[28] AT linkOout: 
PLACE channel[17] AT linklout: 
PLACE channel[31] AT link2oub: 
PLACE channel[22] AT link ont: 
cross.node(14,antichannel[22],channel[28], 
antichannel[17],antichannel[31], 
channel[22],antichannel[28], 
channel[17],channel[31]) 
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APPENDIX E 


EXPANDABLE CHANNEL PLACEMENT 


UT 


((( define link/channel numbers - T4 


WAL linkoout IS 
Er l[inkiout IS 
RAE link2out IS 
VAL link3out IS 
VAL link0in IS 
EAS linklin IS 
PAL link2in IS 
val link3in TS 


ba) 


((( create internal mapping arrays 


wei left.to.right. 
kunk2in] : 

FEM rigqntato. leít. 
Mei'AkOin] 

PAR top.to-bottom. 
Einksin] 1 

MAL bottom,to,top, 
ink in] 

VAL 
B'unk2out,linklout, 
VAL 
EEInkOout,link3out, 
VAL 
EED: out link2out, 
VAL 

@binklout, linkoout, 


O: 
d: 
DR 
3: 
4: 
Ss 
6: 
que 
in Ba pian, linksın, 
T TEE. ink2in, linklin, 
in ToS iniciando Linkoan, 
in TS nk LINKZIN, 


rene to right out 
AE OUE] inkOOUE ] 
right.to.left.out 
IE IK OUE Vink2out | 
COP to boron. out 
linkoout,linklout™] 
O i Om O COD. Ou 
EK out link3out ] 


-- each soft channel is associated with a table 


indexed 


Hindi in, 


ane pir; 


lek? in, 


Join 


nalen 


-- when the soft channel is placed on to a hard channel. 


i) 


((( declare size structure 


VAL n IS 4: 
ET p IS n: 
Em q IS n: 


-- x dimension of array 
-- y dimension of array 


VAL nodes IS p * q: 


5 ) 
"f declare size channels 
[nodes] CHAN lertsto.richt, 


ELGG, CO. Lert: 


[nodes + 1] CHAN top.to.bottom, 
bote. io. top: 


Dez? 


147 


ES 


IS 


ds 


IS 


is 


((( node 1 
((( declaration of constants 
VACSI 17920: 


VNDE 
VAL dec.machine IS O: 


VAL left IS (dec.machine + (nodes - q)) \ nodes: 
VAL richt IS dec.machine: 

VAR borr om IS dec.machine: 

VAL top IS nodes: 

VAL mav.index IS (««GV2) *2 PENE 

11) 


ETE STE ES EN EN = cm À 
FROGEN. di la 


((( placement or channels 


PLACE teft cor igne [leie AT left.co.riol am 
[nap.index]: 

PLACE Left to rFlgne [cigni AT Jleft.to.right ow 
[map.index]: 

PLACE right. OL ere rake | AT right.to.lefvøsn 
[map.index]: 

PLACE “ight. ce. tere [left] AT right.to.left ie 
[map. index]: 

PLACE top CO. BOTTOM [top] AT CLop.to.boctomnmsm 
[map.index]: 

PLACE (top. to. bottom [bottom] AT top.to.bottom ene 
[map.index]: 

PLACE DOL CO. CO Cop [bottom] AT bottom. to. tep 
[map.index]: 

PLACE bottom tontos [top] AT 
DBOTLOM.CO.,. Top. out (map. index IE 

YY] 


node (1, left.to.right [left] left.to.right [richer 
right.to.lert [right] siet te er: 
top.to.bottom [top], top-to. bottomlMhottomil 
bottom. to.top [bottom], bottom. to. top “EA 


230) 

((( node q 

((( declaration of constants 
VAL 1 LS 0% 


Val 18 qe: 
VAL dec.machine IS a-l: 


VAL left IS (dec.machine + (nodes - q)) \ nodes: 
VAL rignt IS dec.machine: 

VAL bottom IS dec.machine: 

VAL dec.j IS (5 + (g-1 wa. 

VAL top IS dec.) + (i * q): 

VAL map. index IS ((JS2 3% 2) 212208 232 

2:313 
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EROCESSOR q T4 
((( placement of channels 


PLACE Tert- to rigot [left] Mere o AGE. In 
Ep index]: 

PLACE let toria LES] AT. letter cormighe out 
(map. index]: 

PLACE EidheE.to. left r ghi j AT else bo jer r in 
Misap.index]: 

penes rght.to.left [left] Du rie een Lett .OUt 
[map.index]: 

PLACE COP. TO. L-OETOM ETO] AT SD EO bo in 
[nap.index]: 

PLACE top.to.bottom Bosco] DUE GI sombere out 
Map. index]: 

PLACE portom: to. top [Dot rom) AT eu Om “EOS i'n 
[map.index]: 

PLACE Feo Eon co. cop COP] AT 


Ferrom.to.top.out[map.index]: 

33) 

Mee cc epic ae [lete] left.to.right [right], 
ms ccuesughtjesxght.to-left [left], 
Poor Dor com ap], top.to.bottem [bottom], 
Eon se. OP Porn | botton.to. top [top] ) 

BS? 

EA 1 IS 0: 

PLACED PAR J - 1 for (q-2) 

Un dee machine IS j + (i * q) 
VAL machine IS dec.machine + 1 


PROCESSOR machine T4 


COC evaluate indices 

VAL left IS (dec.machine + (nodes-q)) \ nodes: 
Mae E IS dec.machine: 

EE OTT IS dec.machine: 

VAL dec.j RS m W)” Aa: 

VAL top dee MTC): 

VAL map. index Te No a TL OO 


-- position of node within the  B003 
group. 


13) 
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((( placement of channels 


PLACE left EGO righe [left] AT left.to. right n 
(map.index]: 

PLACE Jer tos stone Liga) AT left.to.righwuz 
[map. index]: 

PLACE righe. Ce lert proe AT right. to. less 
Fa ree] 

PLACE rignt ror eme [left] AT right.to.lef moe 
Panes 

PLACE TOP TO HE ro kepi AT top.to. bottom 
[map indexi: 

PLACE TED ro bokse (oek Ee AT Ltop.to.botrommssa 
inem dese 

PLACE born otro [bottom] AT bottom. to Tropen 
[u So diie 

PLACE BE EER. be oD ( top] AT 
bottom to. top. eds ip das |. 

e 


node (machine, left 
HUGE. Ho. ler 
TOP ge ar 
bot com Por Pop 


PLACED PAR i = 1 FOR 


.to.right [left],left.to.right pore Ra 
[right], right.toslem MMC NM 
[top], top.Lto.boLt torus r wp 
[bottom], bottom. te Hos PE 


(P = L) 


PLACED PAR j = 0 FOR Q 
VAL dec.machine IS j + (i * G) 
VAL machine IS dec.machine + 1 
PROCESSOR machine T4 


em evaluate indices 
VAL left IS (dec.machine + (nodes-q)) * nodes: 
VAL rignt IS dec.machine: 
VAL bottom IS dec.machine: 
VAL dec.j IS (j + (9 1) E 
VAL top IS dec] M NE NC ME: 
VAL map.index IS 003 v2) 2) ETAN 
-- position of node within the (EUO 
group: 
Ge 


(ff placement of channels 


PEACE! erb co. mne [left] AT left.to.rign TEE 
[mcus ace E 

PLACE Mett, CO. EGO NE [r Ghi] AT Jleft.to.right eu 
(map. index): 

PLACE Rique. core e AT right.to.le£z 
[map.index]: 

PLACE File. to. eie [left] AT right.to.left eli 
[map. index): 

PAGE  t0p ce Ekke [Cop] AT top.to.botrém won 
[map.index]: 

PLACE COP Le ee [bottom] AT top.to. bottom enn 


aap. AE T 


1250 


PLACE oc rom He. op [Por Com] AT pot comico top.in 
ia index]: 

PLACE BEE EO. Hoffie [ top] AT 
ago EOStop.out[map.index]: 


DES 

donee machine, left to- -right [left],left.to.right [right], 
Excel ar ghetto. left [left], 
roper eo top eo bort tom [bottom], 
porer Fo pe om BOELOM.TO.LOP [top] ) 


Do 


In this appendix we start the placement from processor 01 
on. 


The placement of channels in the I/O handler is as follows: 


EN 

BELEN OF ANY leftin,rightout,antirightout,antileftin: 
GRACE leftin AT link3in: 

ERACE rigmeout AT linK3outu: 

ESACE antirignhntout AT link2out: 

PLACE antileftin AT link2in: 


Iu 
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